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Executive Summary 

This study involved the analysis of two very different types of data — 
simulated Thematic Mapper MSS data and dual-polarized X-Band Synthetic /^rture 
Radar (SAR) data. The first phase of the research examined the impact of the 
improved spatial and spectral characteristics of the Landsat-D Thematic Mapper 
data on computer-aided analysis for forest cover type mapping. The second part 
of the investigation examined, both qualitatively and quantitatively, the value 
of the SAR data for differentiating forest and other cover types, and assessed 
the utility of pattern recognition techniques for analyzing SAR data. 

The study site was located in Kershaw County, South Carolina, and 
contained a variety of forest and other cover types, including pine, mixed 
hardwood, tupelo, recently cleeircut areas (coning back into mixed hardwood) , 
pasture, crqpland, exposed soil, and water, Excelleit quality, cloud-free IMS 
(Thematic Mapper Simulator) data and color infrared photography were obtained 
by NASA on May 2, 1979 and again on August 29, 1980 from 20,000 feet altitude, 
thereby providing IMS data having a nominal spatial resolution of 15 meters. 
The data were spatially degraded to produce data sets having 15 x 15 m, 

30 X 30, m (to simulate Thematic Mapper data) , 45 x 45 m, and 60 x 75 m (to 
simulate Landsat data) spatial resolutions. 

The first j^ase of the analysis examined the relationships between spatial 
resolution and classification performance. This was follwed by a sizable 
effort directed at examining the relaticxiships between the numbers of wave- 
length bands used in classifications and the resulting classification 
performance, as well as the importance of different wavelength bands or 
portions of the spectrum on classification performance. The significance of 
different methods for develcping training statistics and the use of different 


V 


classification algorithms were also investigated. A method for economically 
developing a statistically reliable test data set was also defined during this • 
portion of the study. The final fiiase of the work with the TMS data involved 
an evaluaticm of Principal Conponents Transformations as an alternative to 
feature selection for reducing the dimensionality of the data. 

The X-band SAR data were obtained by NASA on June 30 r 1980 from an 
altitude of 60,000 feet. The images were digitized at J.S.C. and the two 
polarizations were digitally registered at LARS to produce a digital data set 
suitable for quantitative analysis. Initially, a detailed qualitative stu<^ 
evaluated the characteristics of the data and the potential for identifying 
various cover types on the dual-polarized (HH and HV) images. The final phase 
of the research involved a quantitative analysis of the SAR data which included 
conputer classificaticms using both per-point (Gaussian Maximum Likelihood) and 
contextual (Per-Field and SBCHO) classifiers. 

The results of the various classificatiais of both the IMS and the SAR 
data are sumnarized in numerous tables and figures throughout the report. Four 
appendices contain 118 tables showing the classificatiai performance results 
and the statistical evaluations of these results. The three major <A)jectives 
of this research, as well as the several minor objectives pursued, are defined 
in Section II. In addition to the discussions and sunmarizaticHis of the 
results and their significance that are contained in the bo^ of the report, 
Secticxi VI contains a conplete sumnary of the results and some recomnendations. 

Results of this research that are of particular significance include the 
• following: 

1. Use of higher spatial resolution data resulted in lower overall 
classification accuracies when the classificaticais were conducted 
with the standard per-point Gaussian Maximum Likelihood classifier 
(i.e. , 30 meter simulated Thematic Mapper data had lower overall 
classification performances than 80 meter simulated Landsat data) . 
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2. Differences in spatial resolution caused much greater differences 
in classification performance among forest cover types than amc«g 
agricultural cover types. (i.e, , Per-point classifiers produced 
similar classificaticai performances in agricultural cover types for 
the simulated Thematic Mapper and Landsat spatial resoluticsi data 
sets, whereas for forest cover types the classification performance 
of the TMS data was much poorer than for the Landsat data. This was 
due primarily to the increased spectral variability of the forest 
cover types in the TMS data as COTpared to the Landsat data.) 

3. Four v/avelength bands provided the best combination of good overall 
classificaticxi performance and minimum cortputer time, although 
slightly higher overall classification performances were obtained by 
using all 'IMS wavebands available. 

4. Overall classification performances of 85-95%, based on test data, 
were obtained for both the 1979 and 1980 IMS data sets when four or 
more wavebands were utilized in conjuiiction with the SBCHO 
classifier. 

5. Higher classification performances were achieved for the TMS data 
using a contextual classifier (SECHO) rather than Per-Point 
classifiers (L-2 Minimum Distance or Gaussian Maximum Likelihood) . 

6. Principal components transformation of the TMS data did not result in 
higher classification performance when using the SECHO classifier. 

7. Deciduous and coniferous forest cover types can be easily differenti- 
ated on the HH polarized SAR imagery, but not on the HV imagery. 

8. Pine stands and pastures cannot be effectively differentiated on 
either the HH or HV SAR imagery, in spite of the distinct differences 
in physical characteristics of these trfo cover types. 

9. Significant improvemaits in overall classification performance of the 
SAR data were achieved using contextual classifiers (Per-Field and 
SECHO) as coirpared to the QML per-point classifier. 

10. Since only one wavelaigth (X-Band) , represented two channels (HH 
and HV polarizations) of SAR data were available for analysis, 
overall classification performances of only about 65% were obtained 
witbi the SAR data. It is believed that additional wavelengths of SAR 
data would enable significantly higher classification performances to 
be achieved. 

11. SAR data to be used for conputer analysis in future projects (e.g. , 
multi-frequency, multi-polarization) should be obtained through an 
all-digital proc^sing syston in order to minimize between-channel 
spatial distortions in the final data set. 
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I. IMDRODUCTION 

Tremei.dous progress has been made over the past few years in demonstrating 
the potentials and limitations for utilizing Landsat MSS data and computer- 
aided analysis techniques for identifying and mapping various earth surface 
features, including major forest cover groups (deciduous and coniferous) and, 
in seme cases, individual forest cover types. The Thematic Mapper scanner 
system, launched on Landsat-D in July 1982 , has increased ^spectral and spatial 
resolution, as y.'ell as an increase in the nunher of channels, which should 
theoretically allc3w better and more accurate classification of ground features. 
Past experience with aircraft, Landsat, and Skylab MSS data indicates that the 
spectral characteristics (both location and width) of the wavelength bands on 
the Landsat-D Thematic Maj^r system should allow more accurate idaitification 
of forest cover types to be achieved using con^ter-aided analysis techniques 
(Coggeshall and Hoffer, 1973 ; Hoffer and Staff, 1975 ; Hoffer et al., 1975 ). 
The impact of the improved spatial resolution is not obvious, due to the 
interaction between the textural characteristics of some types of forest cover 
(e.g., large-crowned mature deciduous trees) and the spectral response of 
individual high resolution pixels (Kan and Ball, 1974 ; Sadowski and Sarno, 
3976 ) . This investigation was therefore directed at examining the impact of 
the improved spectral and spatial characteristics of the Landsat-D Thematic 
Mapper data on computer-aided analysis for forest cover type maH>ing. 

A second major phase of this investigation involved X-band Synthetic 
Aperture Radar (SAR) data. Radar systems have several unique advantages over 
optical systans. Such advantages include the capability to penetrate clouds, 
to be operated day or night, and to obtain imagery in which the tone and 
texture characteristics are related to the dielectric constant and physiognomic 
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properties of the cx>ver types present, The side-look angle of radar systems 
also prodices characteristics in the data that are not found in data from 
nuLtispectral scanner ^sterns. Because of the different and perhaps unique 
characteristics of radar data, the question was raised as to whether X-band 
radar systems could provide more effective data for differentiating forest 
cover types and daisity differeices than can be obtained using MSS data from 
the optical portion of the spectrum. Earlier work in the mid-1960's with 
K-band imagery showed that sotie vegetative cover types could be differentiated 
and that differences v/ere scanetimes apparent in dual-polarized data (Morain and 
Simmett, 1966, 1967). However, these early studies did not involve X-band 
data and did not indicate which polarization provided the best capability for 
discriminating among forest cover types. Further, none of the earlier work had 
involved the utilization of c€»iputer-aided analysis techniques. Therefore, in 
addition to the questicxi concerning the value of radar data for differentiating 
forest cover types and density differences, this investigation also was 
directed at evaluating the potential for using "standard" conputer classifica- 
tion techniques, previously developed for multispectral scanner data, for 
analyzing dual-polarized X-band radar data. 
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II. OBJECTIVES 


This research involved three primary objectives: 

1. To determine the impact of the spatial resolution characteristics 
of the Thematic Mapper MSS data on classification of forest cover 
types using con^xiter-aided analysis techniques. 

2. TO determinci the impact of the improved spectral characteristics 
of the Thematic Majper MSS data, as compared to Landsat I-III 
data, on the capability to accurately and efficiently classify 
forest cover types using catiputer-aided analysis techniques. 

3. TO evaluate the utility of dual-polarized, X-band synthetic aperture 
radar data for idaitifying and mapping variais forest cover types, 
and for determining differences in d^sity and condition of the 
forest cover. 

Each of these major objectives included several sub-objectives which can 
be defined as follows; 

la. To cOTipare classification performance of 30 meter (simulated Thematic 
Mapper) data to 80 meter (simulated Landsat) data, using a per-point 
classifier. 

lb. To ccapare classification performances, based on a per-point classi- 
fier, using data of four different spatial resolutions (15 m, 30 m, 

45 m, and 80 m) . 

l c. To evaluate the impact of spatial resolution on spectral variability 
of different cover types, with special en 4 >hasis on both forest and 
agricultural cover types, 

l d. To evaluate the effectivoiess of a contextual classifier (i.e. , 

SECHO) , as ccHiipared to per-point classifiers (D-2 Minin^n Distance 
and Gaussian Maximum Likelihood) , for classifying data of relatively 
high spatial resolution such as the 30 m data to be cAjtained by the 
Thematic Mapper, 

2a, To define the minimum number of wavelength bands needed to achieve 
an acceptable classification result, 

2b, To evaluate the importance of the different portiais of the spectrum 
for accurately classifying the various forest, agricultural, and 
other cover types. 
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2c. To determine whether different sub-sets of wavelength bands are 

needed to classify different cover types, or if a single combination 
of wavelength bands is adecjuate for all cover types, 

2d, To evaluate the impact of different methods of developing training 
statistics on the classification results (both overall and for 
individual cover types) , 

2e, To determine the impact of principal components transformations on 
overall and individual cover lype classification performances, 

2f . To determine the minimum number of principal component channels 
required to achieve satisfactory classification results. 

2g. To evaluate the impact of different classification algorithms, using 
30 meter simulated Thematic Mapper data, for both transformed and 
untransformed data sets, 

3a, To qualitatively evaluate the potential for differentiating forest 
and other cover types using dual-polarized X-^Dand SAR data, 

3b. To evaluate, qualitatively and quantitatively, the relationship 
betweai radar look angle and magnitude of the radar return, 

3c, To quantitatively determine the potential for classifying forest and 
other cover types using dual-polarized X-band SAR data and a 
Gaussian Maximum Likelihood classification algorithm, 

3d. To evaluate the impact of (iegrading the spatial resolution of SAR 
data on classification accuracy, 

3e, To determine the effectivaiess of contextual classifiers (i.e., Per- 
Field and SECHO) , as compared to a per-point classifier (Gaussian 
Maximum Likelihood) for classifying SAR data, 

3f . To ccmpare the effectiveness of dual-polarized X-band SAR data to 
that of IMS data for purposes of classifying forest and other cover 
types. 
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III. STUDY SITE DESCRIPTION 

Uie Study site is Icxated in Kershaw County in caitral South Carolina, 
situated on the escarpment between the Piednont platteau and the coastal plain. 
The geograEiiical location of the stu<fy site and the orientation of the flight 
lines used are shown in Figure 3.1. The area changes from a distinctly 
dissected regicm having m(x3erate topogra^iic variability in the north to a 
river bottom area of gently sloping terrain in the south along the Wateree 
River. The soils of the northern area are acid clays of low permeability. 
These grade into loairiy sediments in the river bottom area to the south. The 
more ipland soils of the south are characterized by higher sand fracticMis, The 
geomorphological diversity of the area results in a wide variety of vegetaticai 
cover classes, and there is also a considerable variability in spectral 
characteristics associated with each cover class. These ccmplexities make the 
area a prime choice for testing various remote sensing techniques, iSie area 
was selected by the U.S. Forest Service as one of two primary sites in the U.S. 
to be used in testing various remote sensing techniques having potential use in 
forest inventory (^rations. 

The southeastern portion of the study area has flat to very gently rolling 
topography which provides a minimum of oivironmental variability, with the 
result that single cover classes occupy large contiguous areas. The exception 
is water tupelo which requires a narrow range of water fluctuaticai levels and 
therefore occupies rather restricted areas. The major cover classes of the 
southern area are bare soil, pasture, crops, pine, pine-hardwood mix, hardwood 
(both old age and second growth) , water tupelo, clearest areas, marsh 
vegetation and water. The bare soil areas are generally associated with 
agricultural activities or are areas of recent clearcuts. Areas in crops are 





associated with a wide variety of ground cover conditiOTS, ranging from 
prijnarily bare soil to closed crop canopies, depending on the amount of time 
since planting. . Similarly, the clearcut areas vary in ground cover conditicxi 
depending on the length of the period since cutting. Areas of saturated soil 
and standing water in sane of the clearcuts increase the diversity of spectral 
characteristics associated with that information class. A considerable 
diversity in age classes exists for the pine stands and also for the 
pine-hardwood mix, with consequent variSticsis in canopy closure. The pine 
stands are generally planted slash or loblolly pine. The hardwood (other than 
the stands of tupelo) consist of mixtures of several species including 
sweetgum, black willow, and sycamore. The water class is primarily contained 
in the wateree River, although there are also some spectrally distinct ponds 
associated with a gravel mining operation in the southern portion of tine test 
site. . 

The northern area, being heavily dissected and having somewhat steeper 
terrain, contains cover classes which generally do not occupy large contiguous 
areas. The major cover classes are bare so^, crops, pasture,'’' pine, 
pine-hardwood mix, hardwood, clearcut, water, and urban. The pine areas vary 
in crown closure more in the north than in the southeastern region. The 
hardwoods are generally restricted to relatively narrow gully bottoms. Areas 
in crop and pasture are generally very small due to the size of areas suitable 
for agricultural practices. Most of the surface area in water is in the 
wateree Reservoir, therefore providing a ratio of the frequencies of boundary- 
to-nonh^Qiiiidary pixels very different from that in the south. 


s 
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IV. THEMATIC MAPPER SIMULATOR (IMS) DATA ANALYSIS 

A. Data Collection 
1. ms Data 

The 1979 MSS data used in this study were collected by the NASA NS-OOl 
Ihematic Mapper Simulator (IMS) on May 2, 1979 as part of NASA Flight Mission 
399, Table 4,1 shows the wavelength bands of the TMS scanner and the 
corresponding Landsat-D Thematic Mazier bands. The TMS data were obtained in 
mid-morning under cloud-free conditions from an average height above ground of 
19,500 feet (5,944 meters). At this altitude, the 2,5 milliradian IPOV of the 
NS-OOl scanner provided a 15.3 meter ground resolution element at nadir. 
Unfortunately, the 2.08-2.35 ym band (Channel 7) was inoperable at the time of 
the flight mission, but all other instrumentation was functioning normally. 
Color and color infrared photographs of excelloit quality were taken at the 

same time the scanner data were obtained. The j^otogra^is and documented 

% 

observations of ground conditions frcnn visits to the stucfy area provided the 
reference data for the study, as discussed later. 

In 1980, NASA attempted to obtain a near-simultaneous set of TMS and 
Synthetic Aperture Radar (SAR) data to be used in the analysis of a ccMtiDined 
data set and also to provide a second set of TMS data for evaluating the 
repeatability and reliability of the results obtained with the 1979 data, NASA 
Plight Mission No, 425 was flcwn on July 2 and 3 by the NC-130 aircraft to 
obtain NS-OOl TMS data. However, significant levels of cloud cover in key 
portions of the flight lines caused thei TMS data obtained to be of itarginal 
value. Consequently, on August 29, 1980 NASA Flight Mission No. 430 was flown, 
and resulted in a usable set of TMS scanner data, and color and color infrared 
photografiiy. The data were obtained between 10:00 and 11:00 A.M. from an 


Table 4.1. Channel configuration of the NS-OOl and Landsat-D Thanatic Mapper Scanner ^steras. 
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altitude of 21,000 feet (MSL) over the Camden test site. Hie data obtained 
from this mission was essentially cloud free in the i^uthern portion of Flight 
Line 1 (south of Camden) but there were varying degrees of cloud cover in the 
northern portion of Flight Line 1 and over Flight Line 2. As a result, 
analysis of the 1980 IMS data was concentrated on the area in Flight Line 1 
south of Caradei, All 8 channels of the NS-OOl scanner functioned properly 
diring the August 29 mission. Flight lines were flown from north to south, 
which simplified some of the subsequent data handling activities. 

2. Reference Data 

On-site examinations of the stu^ area were conducted three times 
throughout the stu^. The first set of reference data were cbtained from May 
10-15, 1979 in support of the TMS data obtained on May 2, 1979. ASCS 

photogra0iy was obtained and used for this initial site visit. The 

characteristics of the cover type were documearited at 84 locations throughout 
the test site and t±ese locations were .ioted on the ASCS photos and USGS maps. 
Detailed j,nformation concerning ground conditions at the various locations 
visited throughout the stu^ site are contained in the first quarterly progress 
report (June 1, 1979 - August 31, 1979) , LARS Contract Report 083179. 

In additicn to the 1:40,000 scale color and color infrared photography 
c^tained by NASA at the time of the NC-130 flight missions, larger scale 
photogra^y (1:12,000 and some 70 nm 1:6,000 and 1:2,000 color transparencies) 
were obtained from the USDA Rocky Mountain Forest and Range Experiment Station, 
courtesy of Mr, Robert Aldrich, These U.S. Forest Service 0iotos were (Stained 
in 1977 over selected porticxis of the study site and offered some information 
concerning the characteristics of the forest cover in the study area. 
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A second site visit was conducted from July 1-3, 1980, in conjunction with 
the radar mission on June 30 and the unsuccessful TMS data collection effort of 
July 2 and 3. The third visit to the test site was conducted from July 19-22, 
1981, for the purpose of evaluating results of the TMS classifications and the 
radar imagery analysis. For this last site visit, a number of areas had been 
defined ^luring the course of the analysis, and these were examined on the 
ground to verify the cover type characteristics. Both the second and third 
field trips included observation flights in a Cessna over the study area. 
These "birds-eye" vie^^ of the study area were particularly useful, in that 
some parts of the site were nearly inaccessible on the ground, and the aerial 
vantage provided an effective method for quickly comparing several test site 
locations in the data. These site visits also provided an opportunity for all 
personnel working with the data to become reasonably familiar with the test 
site and the characteristics of the cover types in the study area. Such site 
visits are absolutely necessary in this type of project and of tremendous 
benefit to the research personnel involved. 
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B. Date flartf l ^n g 

1. Reformatting 

The 1979 WS data had been flown from south to north so part of the 
reformatting process involved reversing both flight lines and individual scan 
lines so that they could be displayed with north at the tc^ of the image and 
without a mirror image effect in the individual scan lines. Apprc^riate 
ancillary data was also inserted into the header information for the data tapes 
at the time of the reformatting. 

2, Geometric Adjustment 

The variation in viewing angle (i.e. , +50° from nadir} inherent in 
aircraft scanner data^ results in geometric distortions in the data which hamper 
determiiation of in-place location and area estimates. The objectives of the 
geometric adjustment were to 1) produce a data set which corresponded 
geometrically to the USGS maps of the area and the aerial photography, in order 
to facilitate the location and identification of training and test fields and 
2) to provide a data set which would allow accurate area estimates to be 
obtained from pixel sunnaries. 

The criteria used in evaluating the quality of the geometric adjustmeit 
procedures were 1) whether the scale was consistent in each dimension 
everywhere in the data set and 2) equivalency of scale between the two 
dimensions (i.e., whether a fixed distance on the ground could oe accurately 
determined by a defined number of columns or lines of scanner data) . 

Note that the scale could be consistent in each dimension, but could still 
be very much in error in terms of actual ground din^sions involved. For 
instance, the original scanner data had a considerable distortion in 
equivalency of scale due to over-scanning. As a result, when each scan line 
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was displayed individually, dimensions along the flight line at nadir were 
approximately twice what they were across the flight line. 

Hie instantaneous field of view (IFOV) of the scanner, the average height 
above ground of the aircraft, and the change in scan angle corresponding to the 
analog signal sampling interval were employed to model the geometry that 
resulted from the variable viewing angle of the scanner optics. This provided 
a means for adjusting the across track distortions in the original scanner 
data. A program was written to adjust for the geometric distortions along each 
scan line, and 14 pairs of control points were established at random in the 
data set to evaluate the effectiveness of the geometric adjustment program. 
Both the consistency of scale in each dimension and the equivalency of scale 
between dimensicsis (i.e. , along track and across track) were evaluated by 
superinposing the control points (which were located on a 1:62,500 USGS map) 
onto the geometrically adjusted imagery using a Bausch and Lcmb Zoom Transfer 
Scope. The coincidence of all control points between the map and the scanner 
data indicated that the geometric adjustment had been successful. The details 
of the geometric adjustment procedure are given in the second quarterly 
progress report (September 1, 1979 - November 30, 1979) , LARS Contract Report 
120379, and in Latty (1981) . 

3. Radiometric Adjustment 

Changes in viewing angle of the scanner relative to the angle of incident 
radiant energy can provide a major source of variance in the spectral response 
values recorded. Examination of the 1979 scanner data indicated that there 
appeared to be distinct changes in response levels along individual scan lines, 
even though cover types did not change. These changes in reflectance 
associated with ciianges in viewing angle were confirmed by plotting average 
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refl^tance values by column over data blocks containing the same cover type, 
on a channel by channel basis. Tliese plots shoved that even though the cover 
type was the Same and there were no significant tc^gra£diic effects in this 
portion of the stu^ area, the average reflectance values were considerably 
different as a function of column in the data set (see Figure 4 .1) . These 
differences were therefore ascribed to scanner look angle/illumination angle 
effects. Software was then developed to radiometrically adjust the data in 
order to remove or reduce the variance in reflectance caused by changes in 
viewing angle which were extraneous to differences in cover types. 

For the 1979 data, four areas in the data set which appeared to have no 
across tracks stratification of cover type were identified, and a program was 
developed which computed the average reflectance by column for each channel 
over all of the scan lines in the designated areas, A regression analysis was 
then run for each channel using first, second and third degree polynomials. 
Evaluation of these results indicated that a third degree polynomial would 
provide an adequate fit to the data. Predicted reflectance values were then 
conputed for each column, ar^ for each channel. The predicted reflectance at 
nadir was divided the predicted reflectance of each column, for each 
channel, and the actual MSS response values were multiplied by this quotient 
and these radiometrically adjusted data, values were written onto another tape. 
The second quarterly progress report contains a more detailed discussion of the 
radiometric adjustment procedure, as well as a discussion concerning the 
theoretical considerations involved in such radiometric adjustment procedures. 
The method used to adjust the 1980 data set was somewhat different than 
that used for the 1979 data. In 1979, homogeneous bloaks covering the full 
width of the scanner data which appeared to have no acrcss^'track stratification 
of cover t^pe were idoitified. However, data blocks which fully met this 
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Figure 4.1. Actual and predicted radiometric response level values, as a 
function of column, for Channels 1 and 4, 1S79 IMS data. 
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criterion could not be defined in the 1980 data set. Therefore a method was 
devised which consisted of looking at homogeneous blocks of a single cover type 
which were located at regular intervals across the flight line, A set of 
columns, each of whicli was 20 pixels wide, was marked across the flight line 
and homogeieous blocks of old growth hardwood were located within each column 
group (see Figure 4.2). Figure 4,3 shows the coincident spectral plots of tiie 
old growth hardwood in the differoit column groups for each wavelength band, 
prior to radiometric adjustment. This figure clearly shows that the variation 
in spectral response as a function of look angle is much more important in the 
near infrared than the visible portion of the spectrum, and of relatively 
little importance in the middle or thermal infrared wavelengths. It also shows 
some irregular shifts in radiometric response in certain columns, probably 
caused by differences in the characteristics of the stands involved. 

The regression analysis was conducted using the same software that had 
been developed for the 1979 data set, and the data were adjusted using the 
empirically derived quotients. In evaluating the effectivaiess of this radio- 
metric adjustmoit procedure on the 1980 data, it was determined from the 
regression analysis that as one moved across the flight line, the X-variable 
(location across flight line) was not significant at an alpha level of 0.05. 
Tliis result indicated that the. radiometric adjustment had been successful in 
removing the effect of changes in view angle. Figure 4.4 shows an example of 
the unadjusted 1980 MSS data and Figure 4,5 shows the same area after it had 
been radiometrically adjusted* Details of the analysis of the 1980 radiometric 
adjustment procedure were contained in the eighth Quarterly Progress Report 
(March 1, 1981 - May 31, 1981), LARS Contract Report 053181. 

4. Spatial Resolution Degradation 

Due to the 2,5 milliradian IFOV of the NS-OOl niultispectral scanner and 
the average flying height of approximately 20,000 feet (or 6,560 meters) above 
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Figure 4.3. Coincidental spectral plots of the old growth hardwood in 
different column groups prior to radiometric adjustmait. 
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Figure 4.3, Continued. 
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Figure 4.4. Varian imagery of the radiometrically unadjusted 1980 MSS data 
(channel 5) . 
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ground, the original data had a nominal spatial resolution at nadir o£ 
approximately 15 meters. Neighboring pixels of the 1979 data were averaged 
together to provide data sets of approxinately 30 x 30 meters (corresponding to 
the proposed Thematic Mapper) , 45 <x 45 meters, and 60 x 75 meters (corr^pond- 
ing to the current Landsat data) , (The 60 by 75 meter data set is subsequently 
referred to as "80 meter" data, ht^lying a resolution approximating that of the 
Landsat MSS.) The averaging was unweighted due to an insufficient number of 
pixels to provide a continuous function required to siir;ulate the point spread 
function of each of the respective spatial resolutions. A separate tape file 
was constructed for each resolution from each flight line segment. Figures 
4,6a, b, c, and d are illustrations of small portions of the greyscale imagery 
in Channel 5 for each spatial resolution. These figures are rather dramatic 
exanples of the significance of spatial resoluticxi on the char act<ti. 1st ics of 
the data used to stu^ and map earth surface features. 
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Figure 4,6 (continued) b. 30 x 30 meter. 
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Figure 4.6 (continued) d. 60 x 75 meter. 




c. Evflluatiai «?£ gpfltial ResgjLutiQiLgn aasgification Pecfgnnange 


1. DeveXqpnient of Training Statistics 

This ^lase of the research was conducted using the 1979 data. Training 
statistics were developed using a supervised clustering approach, TV/o 512 x 
512 blocks of the .15 meter spatial resolution data were displayed on the 
OOMTAL Vision One/20, using data from channels 3r 4 , and 5 (0,63-0.69 urn, 
0.73-0,90 Mm, and 1,00-1.30 Mm, respectively). Areas representing each of the 
eleven cover classes referred to in the test site descriptiai were identified 
using the digital imagery and the 1:40,000 color infrared aerial photographs, 
and the line-column coordinates vzere recorded. FORTRAN programs were written 
to convert the line-column coordinates of the 15-meter ^tial resolution 
(XlMTAL image into the 15, 30, 45, and 80 meter spatial resolution coordinates 
of the MIST (Multispectral Image Storage Tape) . A total of 224 training fields 
were defined the analysis. Table 4.2 shows the nunher of training fields 
identified in each cover class and the average number of pixels per training 
field for each of the spatial resolutions. 

The reduction in sample sizes for the coarser resolutions was regarded as 
a natural consequence associated with coarser resolution data and, therefore, 
no effort was made to compensate this effect by providing a prc^rtionately 
greater number of training fields for the coarser resolutions. The relatively 
low number of pixels employed with the coarser spatial resolutioi data for 
developing training statistics using the supervised training field technique 
may have resulted in lower classification accuracies than would have been 
achieved using other training techniques that had previously been shown to be 
well suited for Landsat resolution data.^ However, using different techniques 


^Floning (1977) examined several training techniques and found an 
unsupervised clustering approach ("multicluster blocks") particularly well 


Table 4.2. The Number of Training Fields Defined for each 
Cover Class and the Average Nuirber of Pixels per 
Training Field for Each Spatial Resoluticxi 
(1979 IMS Data) . 


Spatial Resolution 


Cover 

No. of 

Training Fields 

iS 

Meter 

30 

Meter 

45 

80 

Meter 

Soil 

35 

223.0 

55 .6 

25 

11.0 

£^t 

r.T 

V •>>" 

75.7 

19.4 

8.0 

3.8 

Crop 

^4 

168.6 

42.5 

18.4 

8.9 

Pine 

16 

204.4 

50.3 

23.1 

9.8 

Pihd 

4 

318.2 

78.5 

35.7 

15,2 

Hdwd 

17 

926.2 

235.1 

104.8 

46.6 

Sghd 

16 

557,7 

140.1 

60.9 

28.8 


17 

82.0 

20.6 

9.1 

4.1 

Ccut 

22 

772 

194.4 

85.9 

40.7 

Mveg 

2 

596 

147.0 

65.0 

28.0 

Watr 

10 

182.7 

42.8 

20.3 

11.1 


Total 


224 


303.6 


76.3 


33.7 


15.5 
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to develop the training statistics would have added another variable to the 
classification accuracy ccsTfarisons, which was not desirable. 

The fields were grouped ty cover class and each cover class group was 

2 / 

clustered separately for each resolution,^ The cluster analysis resulted in a 
total of thirty-three spectral classes representing the elevoi cover classes. 
Table 4,3 shows the spectral classes defined and the nuntoer of pixels clustered 
into each spectral class, for the data of each spatial resolution. Pooling and 
deleting of cluster classes was avoided where possible to avoid introducing 
different analyst effects in the spectral classes associated with the data of 
each spatial resolution. One spectral class of water for the 45 meter data had 
to be deleted from the training statistics due to an insufficiait number of 
pixels to compute the covariances. The pair-wise separabilities of the 
spectral classes were examined across cover class, within each resoluticxi. 
Based on the class separabilities, the spectral classes were considered 
appropriate for classification purposes. 


suited for developing training statistics in using Landsat data. In this 
approach the analyst locates several blocks in the data. Each block contains a 
multiple of cover classes and cover class conditicais. The blocks are selected 
with the intention of representing all of the cover classes, and the variation 
of their conditions, contained in the area to be classified. Tlie blocks are 
then clustered independently, or in groups, depending on the size of the blocks 
and the dimension restrictions associated with the clustering program. The 
analyst then identifies the cover class corresponding to each cluster class. 
Employing such a ''multicluster blocks" technique with high resolution aircraft 
data was expected to result in pixels fran different cover classes being 
clustered into conmon cluster classes due to spectral similarities among areas 
v/ithin the differ ait cover classes. A pilot clustering of blocks of data 
containing several cover classes confirmed this expectation. 

2 / 

^The convergence parameter was set to 98.5 percent, which means the 
percent of pixels which are not reassigned in the last iteration of pixel 
assignment to the nearest (Euclidean distance) mean is not less than 98.5 
(Phillips, 1973). 
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Table 4.3. The Nunbeir oi' ^?ixels in each Spectral Class of each Cover 
Class, by Spatial Resolution. 


Spatial Resolution 


Class 

15 Meter 

30 Meter 

45 Meter 

80 Meter 

Tupe 1 

Sll 

139 

72 

27 

Tupe 2 

452 

104 

36 

20 

Tupe 3 

403 

99 

45 

21 

Mveg 1 

658 

158 

68 

29 

Mveg 2 

534 

136 

62 

27 

Crop 1 

598 

130 

58 

28 

Crop 2 

2887 

746 

312 

152 

Crop 3 

1003 

266 

127 

65 

Crop 4 

1227 

299 

126 

54 

Past 1 

432 

112 

37 

18 

Fast 2 

572 

164 

70 

61 

Past 3 

1154 

296 

127 

21 

Fast 4 

1233 

303 

137 

68 

Past 5 

419 

104 

36 

23 

Soil 1 

765 

375 

184 

83 

Soil 2 

1919 

909 

428 

187 

SoU 3 

1366 

662 

259 

114 

Pihd 1 

246 

72 

28 

16 

Pihd 2 

1015 

242 

115 

45 

Hdwd 1 

1159 

1319 

693 

335 

Rdwd 2 

1846 

1701 

656 

268 

Hdwd 3 

1043 

955 

418 

189 

Ccut 1 

771 

714 

335 

157 

Ccut 2 

1480 

1294 

562 

285 

Ccut 3 

1414 

1445 

634 

■ 280 

Ccut 4 

666 

732 

324 

122 

Sghd 1 

1597 

909 

428 

203 

Sghd 2 

1979 

817 

324 

139 

Sghd 3 

757 

396 

187 

93 

Pine 1 

1244 

356 

156 

85 

Pine 2 

1946 

429 

205 

72 

Watr 1 

925 

215 

* 

11 

Matr 2 

164 

39 

121 

53 


*Spectral class was deleted due to an insufficient nunber of observations 
to conpute the covariance. 
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2. DevelqpniOTit of Test Data Set (1979 TMS Data; Spatial Resolution Stu^) 

A set of test areas were defined independait of the areas used for 
training the classifier. Such a test data set provides an estimate of the 
classification accuracies expected to be achieved with data of each spatial 
resolution examined. Since the accuracy estimates were obtained in areas 
selected independently from the training areas, the classification accuracy 
estimates would apply to all pixels of the area classified which satisfy the 
test pixel selection criteria.- A method was developed which provided the test 
pixels for all four spatial resolutions simultaneously, and which provided a 
test pixel selection technique which avoided excessive analyst bias. 

The method onployed a line-column grid which was overlaid on the MSS data 
using the OOMTAL image display (see Figs. 4.7 and 4.8) . The use of such a grid 
constituted a systematic sample based on line-column coordinates, with sampling 
intervals of approximately 180 meters in the across-track dimension and 
approximately 450 meters in the along-track dimension. Since the variables 
being sampled (i.e. , cover class and the assigned label) would not vary 
systematically with respect to the MSS line-column coordinate r< 2 lative to the 
sampling interval, the estimates for the mean and variance provided by such a 
systematic sample could be considered to be unbiased (see Cochran, 1963; 
especially pages 206-230) . The grid was constructed such that candidate pixels 
located by the grid were mapped precisely between tiie different spatial 
resolutions. This provided a means of develcping test points for all spatial 
resolutions simultaneously and avoided any identifications of test pixels in 
one resolution from involving more than one pixel in a lower resolution. This 
was achieved using the smallest grid spacing which was integer divisible by the 
number of original data pixels averaged to compute the data values for each 
resolution (i.e., in the across-track dimension the number of pixels averaged 
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A OOMTAL Vision/One image of a portion of the flight line, 
overledd with the cxy.nputer-defined grid used to locate and 
evaluate test fielci-.. 


Figure 4.7 


A magnification of a portion of the same image shown in 
Figure 4.7. Magnification to this scale was used for most 
of the interpretation and ideitif ication of test fields. 


Figure 4.8 
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together were 2, 3 , and 4; therefore, the smallest number for which each 
resoluticxi provides an integer quotient is 12) . In the along-track dimensicxi 
the number of pixels averaged together were 2, 3, and 5, resulting in 30 being 
the smallest value with an integer quotient. The grid spacing was therefore 12 
columns by 30 lines, A PORTOAN program (QUD.FIN) was modified to generate the 
grid for display oti the CX3MTAL, The areas specified by the grid and associated 
with each resolution (the "candidate test pixels") were identified using 
channels 3, 4, and 5 of the 15 meter spatial resolution data and the 1:40,000 
color infrared aerial jiiotografhs. Only those candidate test pixels which 
contained a single cover class, and which the analyst could locate and identify 
with a hi.gh level of confidoice, were recorded as suitable test pixels. The 
test pixels were then mapped into the MIST coordinates of each resolution. 

The grid spacing used provided 1428 possible test pixels for each flight 
line. In the context of the anticipated frequency at which candidate test 
pixels would fail the inclusion criteria, this candidate test pixel sample size 
was considered sufficient to provide sensitive tests for classification 
accuracy conparisons, A total of 523 test pixels were found to be acceptable, 

3, Results of Spatial Resolution Evaluation 

The first results to be discussed are based upon the training data rather 
than the test data set. The reasons for this are that classification accuraq/ 
estimates based on training field pixels provides a "first look" at expected 
classification performance. High classification accuracies of the training 
field pixels indicates that the spectral classes are generally; 

1) statistically separable, 

2) represent no more than one cover class, and 

3) correspond to "natural" regions of concentration, in the 
measuraneut space, associated with the spectral characteristics 
of each of the cover classes in the training fields. 
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The classificaticm results for the training data set are sunmarized in 
Table 4.4 by cover class group and for each of tlie spatial resolutions. All ■ 
seven channels of data were used in these classifications. In order to 
evaluate the significance of possible differences in classification performance 
as a functicai of spatial resoluticm, a technique had to be defined which would 
adequately take into account the fact that there are different numbers of 
pixels involved for each of the four spatial resolutions for each of the 
different cover types. This was acconplished through the use of the harmonic 
mean, which is a weighted average, where the weight is proportional to the 
inverse of the relative magnitude of each elemait included in the average. The 
harmonic mean is, therefore, a mean value of lower magnitude than the 
arithmetic mean in every case where the elonaits are not equal (the harmonic 
mean equals the arithmetic mean where the elemoits are equal) . Ilie harmonic 
mean is regarded as more appropriate than the arithmetic mean for estinating a 
coninon variance among factor levels (e.g. , each resolution) sampled at 
different intensities, since the lowest sampling intensity has the greatest 
weight in determining the mean. 

The harmonic mean is confuted by: 
m , 
r=l "r 

where: 

HM = harmonic mean 

m = the number of elements included in the mean. 

= the number of pixels sampled in computing the 
proporticxi correctly classified using the r (th) 
spatial resolutican. 
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Table 4.4. Statistical Evaluation o£ Classification Performances by 
Cover Class for each Spatial Resoluti(xi (Training Field 
Pixels, Per-Point Ott. Classifier, 7 Wavebands of IMS Data) 


Cover 
Clagg. ' 

15 

Meter 

Tupe 

96.3^ 

Mveg 

94.7^ 

Crc^ 

94.8^ 

Past 

93.2^ 

Soil 

94.9^ 

Pihd 

83,7^ 

Hd^'d 

82.5^ 

Ccut 

79.3^ 

Sghd 

72.9^ 

Pine 

72.1® 

Watr 

79.1^ 


Spatial Resolution 
30 45 


Msfeeg, 

Msfc££ 

98,9® 

100.0® 

97.6® 

99.2® 

97.1® 

98.1® 

95.6® 

96.6® 

95.7® 

96.7® 

89. 8*^ 

91.6!^ 

88 .5*^ 

91.2^ 

87.0*^ 

89.7^ 

85 .1*^ 

91.3° 

81.1*^ 

82.9^ 

74.8® 

79.3®*^ 


80 

Mgtgr 

Harmonic 

Mean 

100.0® 

182.49 

100.0® 

150.64 

97.3® 

771.28 

97.4® 

503.43 

96.6® 

1019.80 

95.1*^ 

146.22 

93. 3^^ 

2092.56 

92. 4^^ 

2297.24 

96.3^ 

U83.66 

95.5° 

420.12 

82.9*^ 

232.17 


Dissimilar superscripts within each particular cover class denotes a 
significant difference at the a = o.lO level of confidence based on 
the Newman-Keuls ' range test conducted on the arcs in transformed 
proportions. The proportions are the relative rates of omission in 
classification. 
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In Table 4.4, as indicated, dissimilar superscripts within each particular 
cover class denote a significant difference between the various spatial 
resolutions at the a =0.10 confidence level. 

The PCC (Percent Correct Classification) levels achieved with data of each 
spatial resolution were rrot statistically different for water tupelo, marsh 
vegetation, crop, pasture, or bare soil. The POC levels achieved with data of 
the different resolutions were statistically different for old age hardwood, 
second growth hardwood, clearcut, and in some cases, for pine and pine-hardwood 
mix. 

The irregular classification accuracies associated with the water cover 
class are believed to be due to the inclusion of the inundated surface mining 
areas as water. These areas are borrow pits wMch contain ridges of spoil, and 
the older spoil surfaces are covered with vegetation. The pixels corresponding 
to these areas are consequently canposite measureraoits of the spatially 
weighted irradiances associated with each of the ground cover materials 
actually present. Thus, varying levels of "contamination" of the spectral 
characteristics of water with those of another cover class, is believed to be 
the factor responsible for the low classification accuracies achieved for 
water. The fact that nearly all of the misclassified water pixels were 
classified as a spectral class representing clearcut areas of inundated soil 
with standing vegetation tends to confirm the above scenario. It is of 
interest, however, that classifications conducted with 80 meter spatial 
resolution data appear to be more robust in the context of these levels of 
contamination. 

The greatest changes in PCC with, respect to spatial resolution occur with 
the forest cover classes. The differences in PCC among all spatial resolutj.ons 
were found to be significant at the a = 0.10 confidaice level for the old age 
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hardwood, clearcut, second growth hardwood, and pine cover classes. Classifi- 
cation accuracy for these forest cover classes increases with decreasing 
spatial resolution. While the pine-hardwood mix cover class ranged from 83.7 
to 95.9 percent correct classification with 15 meter and 80 meter spatial 
resolution data, respectively, these differences were not found to be 
significant at the a = 0.10 level of confidence. The lew change in PCC with 
respect to resolution for water tupelo as centred to that associated with 
other forest cover classes is probably due to the very distinct spectral and 
spatial characteristics of the water tupelo. 

The results shewn in Table 4,4 are perhaps more easily seen in Figure 4.9, 
which shows a response surface for each of the individual cover classes for 
each of the four resolutions tested. As shown by this response surface, for 
most of the forest cover types, classification performance tends to increase 
rather dramatically with a decreased or larger spatial resolution. On the 
other hand, mixed crop, pasture, mixed vegetation, soil, and tupelo have very 
high classification performances at all four spatial resolutions. (In 
considering the high classification performances shown here, cne must keep in 
mind that these results are for the training data cxily.) These results 
indicate that agricultural cover types may not be significantly iirpacted by the 
higher spatial resolution of Thematic Mapper data, but the classification 
performance achieved for forest cover types using per-point classification 
algorithms may be significantly (and adversely) affected by the higher spatial 
resolution of Thematic Mapper type data. 

Figure 4.10 illustrates the overall classification accuracies achieved 
with the per-point GML classifier using data of each of the four spatial 
resoluticns. The differences between the overall classification accuracies 
achieved with the data of each spatial resolution were found to be significant 
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Figure 4.9. Response Surface of Percent Correct Classification by Cover 
Class, for each Spatial Resolution (Training Field Pixels, 
Per-Point GML Classifier) . 
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I.IO. Overall Percent Correct Classification of Training Field Pixels 
by Spatial Resolution (Per-Point GML Classifier) . 
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at the a = 0.10 confidence level.^ This figure represents one of the key 
results of this project in that it dlesu:!^ shows that overall percent correct 
classif ication (ECC) tends to decrease with improved spatial resolution. That 
is, as the size of the area on the ground corresponding to a single pixel 
increases, overall classification accuracy is expected to increase. 

Further evaluation of the data for the different spatial resoluticxis 
indicated that the spectral variability from among adjacent pixels was much 
higher with the higher spatial resolutiai data sets. Such variation in 
spectral response level is clearly shown in Figure 4.11, which depicts the 
variation in spectral response for a single scan line in each of the spatial 
resolution data sets. Ihese graphs provide some insight as to why the 
classificaticxi performance at the 15 meter spatial resolution was sometimes 
much poorer than at the Landsat spatial resolution. At the 15 meter spatial 
resolution, pixels for a given cover type tend to have so much spectral 
variability that many pixels could be ^ctrally similar to a completely 
different cover lype. However, at the Landsat spatial resolution, the texture 
in the data tends to be averaged out within a particular pixel and the 
reflectance for that pixel, is a representation of the overall spectral response 
within the pixel area. Ihis overall or averaged spectral response is often 
sufficiently different for different cover types that pattern recognition 
eilgorithms can be used to effectively differentiate between the cover types 
involved. For example, the spectral response of Land^t resolution pixels of 
hardwood is sufficiently different from pine to allow effective 
differaitiation, whereas at the 15 m spatial resolution, some pixels within the 


^This test for significant differences between levels of percent correct 
classification used the Newman-Keuls* range test employing the arcsin transfor- 
mation of the perceit of correctly classified pixels. 



SPECTRAL RESPONSE LEVEL SPECTRAL RESPONSE LEVEL 




-METERS 


Figure 4.11. Variation in spectral response level with respect to distance 
in the across-track dimension for 15, 30, 45, and 60 meter 
sampling intervals. 
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harA/ood area nay actually fall partially on a shacSgw area between two tree 
crowns, possibly resulting in a spectral response similar to that of 
illuminated pine crowns. In such a case, this pixel within the hardWood forest 
area probably would be misclassified as pine. OJius, due to the greater 
spectral variability found among the individual pixels in the higher resolution 
data, many pixels are misclassified, particularly in the areas of forest cover 
(where spectral variability is higher than in the agricultural cover types) . 

The effect of spatial resolution on overall performance and on classifica- 
ticxi of the various cover types was next evaluated using the test data set. 
Again, all seven wavebands were used for the classification. 

The overall P(3C based on test pixels achieved using the "per-point" GML 
cl^sifier with data of each spatial resolution are illustrated in Figure 4.12. 
The differences between the POC levels achieved with data of each ^tial 
resolution were not found to be significant at the a = 0.10 level of confi- 
dence, The magnitude of the differences between classification accuracit 
achieved for training pixels and test pixels is much larger than the magnitude 
of the differences between PCX: levels achieved for data of each spatial 
resolution. This would indicate that the degree to which the training classes 
represent the entire area to be classified is a more important determinant of 
classification accuracy than is the resolution of the data with whj.ch the 
classifications are conducted. However, the training field pixels are 
considered to provide a more sensitive estimate of the canparative PCC levels 
achieved due to either spatial resoluticai of the data or the classifier 
employed, since the factors affecting the outcome are more nearly rcCitricted to 
the "resolution” factor, or the "classifier" factor, than when test pixels are 
used to conduct the cai^iarison. 
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Figure 4.12. rarrect classification obtained using data of 

r different spatial resolutions, based on test pixels. 
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Table 4,5 provides a suinraary of the statistical evaluation of the 
differences betweai data of each spatial resolution for each cover class. As 
indicated, when the evaluation is based on test pixels, only the POC obtained 
for a subset of the cover classes characterized fay large levels of spectral 
variability across adjacent pixels (i.e. , old-age hardwood and clearcut areas) 
are significantly different at a 0,10 a -level. The relatively small numbers of 
test pixels for some cover types, especially at the larger spatial resolutions, 
and the large differences in classification performance t»etween the training 
data set and the test data set would suggest that the test data set was not a 
sufficiently large sample in this case. Since the estimate of the variance of 
the transformed proportions is a constant, inversely proportional to the number 
of test pixels, tiie sensitivity to "real” differences between PCC is directly 
proportional to the square root of the number of text pixels. The estimation 
of PCC for the area classified is caught in the quandary of including a 
sufficiently large number of pixels to provide a sensitive test for "real" 
differences, and providing a sampling technique which assures that each test 
pixel satisfies the "sample" criteria. Thus, further evaluation of techniques 
for defining a test data set using appropriate statistical sampling procedures 
was necessary. 

Although these test data results were not as forceful as the results 
obtained with the training data set, the same traids are present in both 
results. Since the training data represent relatively large numbers of pixels 
of each cover type, it is thought that for the purpose of evaluating the effect 
of differait spatial resolutions on classification of known cover types, both 
the test and training data sets provide a reasonable basis for arriving at the 
following conclusions: 

1. The use of successively higher spatial resolution data resulted in 
lower overall classification accuracies when classif icatiais were 
conducted with a "per-point" OIL classifier. 
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4.5. Statistical Evaluation o£ Percait Correct Classificaticm 
Performance by Cover Class for each Spatial Resolution 
(Test Pixels, Per-Point GML Classifier ) J 


Spatial Resolution 


Cover 

Class 

15 

Uet&L 

30 

Meter 

45 

80 

Harmonic 
■ Mean 

T\;pe 

66.7^ 

55.6® 

55.6® 

66.7® 

9.0 

Mveg 

21.1^ 

26.3® 

31.6® 

31.6® 

19.0 

Crop 

69.7^ 

78.8® 

84.8® 

82.1® 

31.86 

Past 

86.7® 

92.9® 

92.3® 

100.0® 

13.52 

Soil 

87.5® 

85.9® 

81.7® 

86.9® 

62.97 

Pihd 

29.0® 

35.5® 

25.8® 

22.6® 

31.00 

Hdwd 

72.4® 

77.6®^ 

81.4^ 

81.4*^ 

156.00 

Ccut 

77.5® 

76.1® 

81.7®^ 

88 .4*^ 

70.59 

Sghd 

66.7® 

72.4® 

69.4® 

65.5® 

121.49 

Pine 

36.4® 

27.3® 

18.2® 

36.4® 

11.00 


'^Dissimilar superscripts within each particular cover class denotes a 
significant differaice at the a = 0.10 level of confidence based on 
the Newman-Keuls ' range test conducted on the arcsin transformed 
proport icxis. The proporticxis are the relative rates of omission in 
classification. 
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2. Higher classification accuracies were achieved with the "per-point" 
classifier using 60 x 75 rneter (as opposed to higher) spatial 
resolution data in cover classes associated with relatively high 
levels of spectral variability across adjacent pixels (i.e. , old-age 
hardwood, second growth hardwood, pine forest, and clearcut areas) , 

3. Differences in classification accuracies achieved with data of 
different spatial resolution were not significant (a = 0,10) for cover 
classes associated with relatively low levels of spectral variability 
across adjacent pixels (i.e, , pasture, crops, bare soil, or marsh 
vegetation) . 


In summary, although Thematic Maj^r data will undoubtedly be better than 
the current Landsat data frcxn a mensurational stan(%)oint, these preliminary 
results, showing a decreased classificati<»i performance with higher (e.g., 
smaller) spatial resoluticxi, tend to indicate that conveitioial per-point 
classification technig^jes may not be effective when using higher resolution 
data, particularly for areas involving classification of forest cover. Thus, 
classificatiai techniques such as "SECHO" (whj.ch utilizes the spatial 
variability in addition to the mean spectral response of an entire forest stand 
or agricultural field) , need to be tested and refined for potential use with 
Thematic Mapper data. 
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D. Evaluation Of Diffecent, Ambers and Combinations of Wavelength Bands on 

Classification Perfornance 

1. Introduction 

As indicated previously, a najor objective of this research was to 
evaluate the effect of using different numbers or combiratiOTis of wavelength 
bands on the classification results, with Landsat data only involving a 
maximum of four wavelength bands, there has been a tendency on the part of many 
analysts to siirply use all four channels in all classifications without 
worrying about the increase in computer time involved. However, with the 
advent of the Thematic Mapper on Landsat-D, it is anticipated that more concern 
will be e^qpressed about the number of wavelength bands to be utilized, siiK:e 
the classification time involved when using a Gaussian Maximum Likelihood 
classifier has been shown to increase logrithmically with increasing numbers of 
wavelength bands, with cxily a slight or perhaps no corresponding increase in 
classification performance after the inclusion of four or five wavelength bands 
(Hoffer and Coggeshall, 1973; Hoffer et al., 1975). Figure 4.13 shows an 
excellent example of these relaticnships. 

With Thematic Majper data, several questions can be raised concerning the 
nunber and combinaticxi of wavelength bands to be used in a classification, 
including: 

(a) What is the minimum number of wavelength bands needed to achieve 
a "satisfactory" classification result? 

(b) Are certain portions of the ^)ectrum more important than others 
in accurately classifying a variety of cover types? 

(c) Are certain particular combinations of wavel<aigth bands more 
important than others- in accurately classifying a variety of 
cover types? 

(d) Will differoit sub-sets of wavelength bands be needed to classify 
differait cover types, or will a single combination of wavelength 
bands be adequate for all cover types? 


Overall Accuracy (%) 



Computer Time (Minutes) 
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2. Transfonned Divergence Evaluation Using the 1979 Training Statistics 

The next major porticn of this research project was directed at answering 
the above questions. The first |hase of this work involved the 1979 data set. 
Supervised training fields were defined on the CXDWTAL Vision One/20 display, in 
conjunction with the color infrared photography and the field notes. Once the 
training fields had been ideitified, they were grouped according to cover 
class. Ohe cover class groups of training fields were then individually 
clustered to resolve tiie cover classes into a set of spectral classes. This 
provided training class statistic? corresponding to a set of spectral classes 
associated with each cover class. Clustering at this stage provided a means of 
defining training classes within each cover class that were bc3i;%d on the 
spectral characteristics of the data rather than some descriptive parameter 
that might be poorly correlated with the spectra^, characteristics being 
recorded by the scanner. 

The mean vector and covariance matrix coirputed for each of the spectral 
classes define the individual statistical density associated with each 
respective spectral class. A measure of statistical distance betweoi all 
pair-wise ccmibinations of the spectral classes provides information on the 
"separability" of these ^ctral classes. This "separability" represents an ^ 
priori estimate of the probability of correct classification (Swain, Robertson, 
and Wacker, 1971) for measurements provided by each channel or channel 
combination. Only pairs of spectral classes belonging to different cover 
classes are of interest, since lew separability between different spectral 
classes of the same cover class does not affect classification accuracy. 

Transformed divergence was used to conpute the separability. Divergence 
is defined as: 

Pl(x) 


dx 


( 1 ) 
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where: p,(x) = statistical density of 
spectral class 1 

P 2 (x) = statistical density of 
spectral class 2 

or conputationally, for the Gaussian multivariate case: 

D = ■^ tr Z 2 ) ( 1 : 1 ^ “ S 2 ^)J + 2 l 2 ^) (n>i ~ 

(mj^ - (2) 

where: Z is the covariance matrix and m is the 

mean vector associated with the respective 
spectral class, and 

tr (trace) is the sum of the diagonal 
elements. 


Since divergence increases without bound as the statistical distance 
between the two classes increases, a saturation transform is enployed, 
resulting in a measure (i.e. , transformed divergence) which corresponds more 
closely with percent correct classification* After a certain level of 
statistical difference has been attained, virtually no confusion exists between 
the two class densities, and percent correct classification "saturates" toward 
100%, The resulting transformed divergence is provided by: 

TD = 2000 [1 - exp(HD/8)J (3) 


There are some disadvantages to the use of transformed divergence as a 
measure of statistical difference betweei class densities,^ but because of 


*******★★*• 

^It should be pointed out that transformed divergence is not "metric" in 
multivariate normal distribution functicxis of non-equivalent covariance 
matrices (Wacker and Landgrebe, 1972). That is, a pair of class densities 
having non-equivalent covariance matricies yet having equal mean vectors could 
have a transformed divergence value of zero. Also, there is no estimate for a 
lower confidence limit for the regression relation betwew transformed diver- 
gence and percent correct classification (Swain, Robertson, and Wacker, 1971) . 


52 


relative conpitational efficienc/ it is used in lieu of the alternative 
measures. 

Transformed divergence (TD) values were con^wted for each pair of spectral 
classes representing different cover classes, for each channel and channel 
combination. Ihese mean pair-wise TD-values were then sorted for each set of 
combinations involving the same number of channels. The seven channel 
combinations providing the highest mean pair-wise TD-values were obtained. 
Additional programs were written to generate summaries of the mean TD-values 
for each pair of cover classes (i.e^ , over all spectral classes representing 
the cover class pair) and each cover class (i.e. , over all cover class pairs 
involving the cover class; j = 2) for these seven channel 

combinations. 

To define the optimum number of channels to use in a classification, the 

relaticaiship betwe^ cost of misclassification and the probability of error 

must be determined. Otherwise there is no meaningful way to compare 

classification cost to classificaticHi accuracy. It can be observed from Figure 

4.14 that the increase in transformed divergence (the correlate to probability 

of correct classification) drops off sharply after three channels, and very 

little is gained by using more than four channels. This result is similar to 

those obtained previously with the Michigan M-7, 12-channel scanner (Coggeshall 

and Hoffer, 1973), and the Skylab ISKJhannel S-192 scanner (Hoffer et al., 

1975). The shape of the relaticxiship shown in Figure 4,14 indicates tiiat 

transformed divergence increases logarithmically as the c»mbinaticwi level 

2 / 

increases linearly.^ The spread of the points representing the five highest 

******★★** 

2 / 

•^To simplify the follcwing discussicxis, "combination level" will refer to 
the number of channels involved in any particular set of channel cxxnbinations. 
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Figure 4.14. Averaged transformed divergence for the best five waveband 
ccfflibinations for each combination level. 
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ranked channel combinations for each couibinatiai level represents the 
difference between successively ranked averaged transformed divergence. As 
seen in Figure 4.14, the mean difference between successively ranked mean 
separabilities decreases logarithmically as the combination level increases 
linearly. flMs implies that the rank of overall mean separability as a feature 
selection criterion decreases in value as the number of features comprising the 
selected feature subset increases. 

Hie best combined sources of infomation for distinguishing betweei 
various cover classes need not have as a subset the best single source of 
informaticMi. This is indicated in Table 4.6, which shows, for example, that 
the single channel having the highest mean TD-value (i.e. , channel 6) is not 
included in the 2, 3, and 4 channel combination l^els having the highest mean , 
TI>“Values, By conparing Table 4,6 with Table 4.7, it can be deserved that the 
best channel or channel combination for each contoination level, on the basis of 
mean overall separability, is not necessarily superior on a per cover class 
basis. 

Examination of the transformed divergence data indicated that the channel 
combination with the highest mean separability for a particular ocxnbination 
level does not necessarily provide a greater separability for all cover class 
pairs than channel combinations of a lower combination level, when the 
combination of the Icwer level is not a subset of the canbination of the higher 
level. Examples of this relationship are: soil vs. water has a mean TD-value 
of 1942 in channel 6 and a mean H)-value of only 1824 in channel combination 
3,4; PIHD vs, CCUT has a mean TD-value of 1835 in channel 6 and a mean TD-value 
of only 1641 in channel coibinatic« 3,4; PINE vs. MVBG has a mean TD-value of 
1424 in channel 1 (the channel ranked third on the bcisis of mean overall 
TD-value) and the mean TD-value of 1182 in channel combination 3,4 (the number 
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Table 4.6. Channel conbinations, ranked by overall mean TD-value for conbina- 
tion levels one through six. 


COMBINATION LEVEL 


1 

2 

3 

4 

5 

6 

6 

3,4 

3,4,5 

1,3 ,4 ,5 

1,3 ,4,5 ,6 

1 ,2 ,3 ,4 ,5 ,6 

3 

3,5 

3,4,6 

3, 4,5 ,6 

2 ,3 ,4 ,5 ,6 

2,3 ,4 ,5 ,6 ,7 

1 

2,4 

3,5,6 

1,3,4,6 

1,2,3,4,5 

1,3,4,5,6,7 

5 

2,5 

2,4,5 

3,4,5,7 

1,3,4,5,7 

1,2 ,3, 4, 6 ,7 

2 

3,6 

2,4,6 

2,4,5,7 

3, 4 ,5 ,6 ,7 

1,2 ,4 ,5 ,6 ,7 

4 

4,6 

2,5,6 

2,3,4,6 

2,4,5, 6,7 

1,2,3,4,5,7 

7 

1,4 

1,3,4 

1,3,5,6 

1,2,3,5,6 

1 ,2 ,3 ,4 ,6 ,7 


Table 4.7. Best channels and channel conbinations by TD-value for eadi cover 
class. Upvalue is in parentheses. 


COMBINATION LEVEL 



1 

2 

3 

4 

soil 

3(1820) 

24(1941) 

256(1987) 

1346,2346,1356(1992) 

past 

6(1476) 

35(1878) 

345(1971) 

3457(1987) 

crop 

3(1390) 

34(1836) 

345(1971) 

1345(1991) 

pine 

2(1435) 

34(1780) 

346(1912) 

3456(1960) 

pihd 

2(1580) 

36(1883) 

356(1982) 

3456(1997) 

hdwd 

3(1688) 

34(1881) 

134(1933) 

2346(1952) 

sghd 

3(1691) 

35(1933) 

346(1960) 

1345,1346,2346(1972) 

ti^)e 

6(1658) 

34(1896) 

245,345(1979) 

2457(1992) 

syca 

5(1753) 

35(1979) 

345(1994) 

1345,1346,1356(1999) 

ccut 

6(1329) 

• 46(1707) 

356(1889) 

3456(1947) 

mveg 

4(1270) 

14(1739) 

134(1941) 

1345(1990) 

watr 

5(1853) 

25(1988) 

246,256(1999) 

1345,1346,1356(2000) 


SOIL, bare soil; PAST, pasture; CROP, row and cereal crops; PINE, pine forest; 
PIHD, pine-hardwood mix; HDWD, old age hardwood; SGHD, second growth hardwood; 
TOPE, water tupelo; SYCA, ^caraore hardwood; OCUT, clearcut areas; KVEG, marsh 
vegetation; WAOR, river water and quarry water. 
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one ranked channel combinaticm of all coni>inations involving two channels) . 
Die same relationship holds for many other cover class pairs. Such a 
relati€*iship was not found when the lower level channel ccmibimtion was a 
subset of the higher level channel combinatioi (as would be expected) . 

By increasing the combination level, the additional average separability 
achieved for each cover class varies greatly betwe^ cover classes and 
combination levels, but generally decreases logarithmically with increasing 
combination level. Figure 4;, 15 can be thought of as a "sepatability response 
surface.” The apparent lengtli of the lines connecting different combination 
levels of the same cover class is proportional to the added separabilitY 
resulting from the information in the additional channel. Note that the 
greatest incr^se in separability due to the addition of the second channel 
occurs witi\ second growth harA^ood, As one would expect, the smallest increase 
in separabililY occurs with that cover class with the highest single channel 
separability (soil, in this case). It should be noted that the lirsR 
connecting the different cover classes are present merely to indicate relative 
differences of separability and in no way iiiply any functional relationship. 

Figure 4.16 plots the maximum transformed divergence observed for each 
cover class in each conbination level. This displays the maximum separability 
attainable for each cover class if the waveband combinations were selected on 
the basis of each cover class TD-value alone. As is clearly shown, the 
specific waveband combination resulting in each particular TD-value for any 
given waveband combination level is not constant over the different cover 
classes. In comparing Figures 4.15 and 4.16, it is apparent that the shapes of 
the surfaces become more and more alike as waveband conbination level is 
increased, cuid are nearly idaitical in shape after conbination level 4. This 
indicates that the separability by cover class provided by the best overall 



Figure 4,15, Averaged transformed divergence provided by the best overall 
waveband combination by waveband combination level and cover 
class. 



Figure 4,16, Averaged transformed divergence provided by the best wavdaand ! 

combination for each cover class by waveband combination level i 

and cover class, | 

I 
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channel combinaticai (Figure 4.15) is nearly idaitical to the separability by 
cover class provided by the best channel conbination for each individual cover 
class (Figure 4.16) beyond waveband coinbinaticm levels of 4, Thus, the best 
four waveband combination, based on overall transformed divergence, should 
provide very close to the maximum classificaticxi accuracy for each individual 
cover type. However, if one were interested only in a particular cover type, 
high classification accuracy probably could be achieved using less than four 
channels of data. 

Based upon these results, therefore, one would not expect a conpiter-based 
classification employing more than four channels to provide much in^rovemeit in 
overall classificaticai accuracy. The highest overall mean separability was 
provided by channels 1, 3, 4, and 5 (0.45-0.52, 0.63-0.69, 0.76-0.90, and 
1.0-1 .3 ym) — two visible and two near infrared channels. Note however, that 
this channel combination did not always provide the highest mean separability 
by cover class nor by pairs of cover classes. 

It should be noted that results such as these are highly data and 
application dependevt. A different set of cover classes, or even a subset of 
the cover classes, could result in other channel combinations yielding higher 
or lower predicted classificaticai accuracies. For this reason, these results 
were further evaluated by ccanparing them to results obtained with a different 
set of training statistics developed by another analyst, which are discussed in 
the next secticai. Furthermore, the results discussed thus far have involved 
only predicted classification accuracies, based on the Transformed Divergence 
values of the training statistics. It was therefore in^ortant to evaluate 
different wav^^and ccxibinations using actual classific:aticxi results, both for 
training and test data sets. 
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3. Effect of Different Numbers and Combinations of Wavelength Bands on 
Classification Results 

The next phase of the investigation l:i'A’\ved co«$arisons among a large 
number of actual classification results using both the 1979 and 1980 data sets, 
in which different numbers of and ccmibinations of wavelength bands were 
utilized. Classificaticai of a second data set was desired in order to evaluate 
the repeatability and reliability of the results obtained from the first data 
set. In order to eliminate as many variables as possible, only the 30 meter 
spatial resolution data set was used in these evaluations, and only the 
Gaussian Maximum Likelihood (GML) algorithm was utilized, A single set of 
training and test statistics were developed for the 1979 data, and another set 
were developed for the 1980 data. Each set of test data was then used for all 
wavdaand canparisons for the particular data set involved. Because the 1979 
data had been obtained on May 2 but the 1980 data had not been obtained until 
August 29, there were some significant differences in the vegetative condition 
of the various cover types. It was thought that this might cause some 
differences in the results between the 1979 and 1980 data for the waveband 
evaluation p>rtion of the investigation, but the two data sets would also 
provide some indication of the importance of the various wavebands, based upon 
the repeatability of the results, 

a. Development of Training Statistics 

For the results of this phase of the investigation to be valid, it was 
important that an accurate, representative set of training statistics be 
developed. Previous work had shown that the method used to develop training 
statistics for Landsat data could cause differences in classification 
performance ty as much as 14%, based on evaluation test data (Fleming and 
Hoffer, 1977), In that stucfy, the Multi-Cluster Blocks technique was found to 
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be the best for achieving the highest overall classification performance. 
However r in the current stu(fy, it was iirqportant to evaluate the effectiveness 
of various wavelength bands and spectral regions for specific cover types, 
thereby indicating the need to use the "standard” supervised technique for 
developing the traiiiing statistics. To provide an additional evaluation of the 
different methods for developing training statistics, therefore, both 
techniques (i.e. , Sipervised and Multi-Cluster Blocks) were used and the 
results were coitpared. 

Hie training classes defined for this phase of the investigation and the 
number of spectral classes corresponding to each cover class are shown in Table 
4.8. Because the earlier work had indicated relatively siiell spectral 
differences between old-growth and second growth hardwood, these categories 
were grouped into a single "hardwood" category for the remainder of the 
investigaticxi. Additicxially, because the earlier work had resulted in mly two 
and four training fields being defined for mixed vegetation and pine/har^^ood 
mix, respectively, and due to the difficulty of defining additicmal areas of 
similar characteristics for use as test fields, these cxiver type categories 
were not used in the rennainder of the stu^. Separability of the i^ctral 
classes representing the different informational classes was verified by 
histogram plots of the training data, and further checked using transformed 
divergence values. The transformed divergence values indicated that in most 
cases a very high separability could be achieved for most channel combinations 
when utilizing three or more of the sevoi available channels of the 1979 IMS 
data set (1980 had 8 channels) . Seme potential difficulties did show up, 
however, such as a relatively low separability between a spectral class of 
pasture and one of clear cut, but for most channel combinations of four or more 
channels, even this oonfusicai did not appear to be significant* 
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4.8. Description of the cover classes and number of spectral classes 
within each cover class (1979 IMS data, waveband evaluatic« 

StU(ty) , 


Cover 

Number of 

Spsstral.CIflgges 

Description of Cover Class 

Tupe 

2 

Water tipelo; generally restricted to 

remnants of narrow ox-bow lakes and other 
areas of inundated soils. 

Crop 

2 

Row crc?)s and small grain crcps in varying 
stages of size, canopy density and 
maturity. 

Past 

4 

Pasture and old fields; plant cover varies 
from healthy, improved pasture grasses 
to senescent forbs and invader species. 

Soil 

4 

Bare soil areas associated with agricultural 
activities; varies in sand, clay, and 
organic material content as well as 
moisture content. 

Hdwd 

2 

Middle to old age bottom-land hardwood; 
mixed species, found in stands varying 
from very dense to stands with large 
inter-crown gaps. 

Ccut 

6 

Areas subjected to clearcut forestry prac- 
tices; ground cover comprised of dry to 
inundated soils with varying aiv.ounts of 
residual or regeneration vegetation. 

Pine 

3 

Pine forest plantations, primarily slash 
and loblolly; evoiaged stands at various 
stages of maturity. 

Watr 

4 

Water; includes the Wateree River, dark 
marsh water, and water associated with 
surface mining. 
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As mentioned above^r in addition to the supervised training data set, a 
second set of training statistics were developed using the Multi-Cluster Block 
(MCB) technique, in which several heterogeneous blocks of data are defined and 
each is clustered into several (perhaps 15-25) spectral classes. The cluster 
maps are then cortqpared to the aerial jAiotos and key spectral classes 
identified, while others are merged or deleted, as appropriate. A "MERGE 
STATISTICS" program is then used to combine spectral classes from the 
individual cluster blocks, and a single set of. training statistics representing 
the eitire study area is generated. This second set of training statistics 
provided an excellent opportunity to evaluate the effect of the different 
techniques for developing training statistics on classifier performance. 

b. Development of Test Data Sets 

Four separate methods for developing test data sets were evaluated during 
this study — one based upon an analyst-supervised set of test fields, and the 
other three based upon a stratified sampling procedure incorporating a grid 
systan with dimensions of 50 lines by 50 columns. 

The supervised test data set was selected by two analysts in such a 
fashion as to represent all major cover types present in the stu<^ site, and to 
obtain test data from throughout the study site in case there were any along or 
across-track variations which might still have been present in the data, even 
subsequent to the radiometric corrections aH>lied. Table 4.9 shows the number 
of pixels for each cover class selected by this procedure. The major draw-back 
of this approach is the possibility of analyst bias which may be involved due 
to, perhaps, an unconscious selection of only d^se, homog^eous areas of 
veurious cover types to use as test fields. 


Table 4.9. Con^arison among three techniques for defining a test data 
set using the 1979 data. 



No. of Test Pixels Using Each 

Technique 

Cover Tvoe 

Stpervised 
Test Fields 

Grid Intersection 
With One Test 
Field 

"Sanple Block 
Test Data” 

Tupelo 

210 

126 

118 

Cr<^> 

197 

133 

369 

Pasture 

124 

4 

350 

soil 

606 

261 

1006 

Hardwood 

3032 

8181 

7269 

Clearcut 

537 

163 

370 

Pine 

577 

1299 

775 

Water 


2a 

-■lOfl 

Total 

5447 

10195 

10557 

Percent of 
Total Flight Line 
Area 

2.4% 

4.5% 

4.7% 
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A procedure was therefore developed to define a set of test fields in the 
manner which was essentially free of possible bias introduced by the analyst 
doing the selection, Uiis procedure involved a grid system having a ^cing of 
50 lines and 50 columns, which was overlayed onto the IMS scanner data. Ituree 
possible methods for defining test data sets based upon use of this grid were 
examined. 

For the 1979 data, the grid yielded 78 intersection points in the data. 
The first method based on the grid involved use of a single pixel as a test 
field at each of the intersection points. However, such a procedure would not. 
generate a sufficiently large set of test data to provide an adequate 
evaluation of the classification result. In addition, previous ej^rience had 
sho^m that precise locaticxi of a single X-Y coordinate of MSS data on aerial 
fiiotos or vice versa is very difficult. For these reasons, this single pixel 
technique was not given further consideration. 

The second method based on use of the grid involved designating a test 
field in the URper left corner each grid intersectia.. Each test field 
would be as large a sample as possible of the cover type occurring at the 
intersection, up to a maximum of 25 lines x 25 columns. A Bausch and Lcmb Zoom 
Transfer Scope (ZOS) was used to transfer the grid intersection locaticais to 
the aerial 0iotos in order to identify the cover types. Details of these 
procedures were docuraaited in the eighth Quarterly Progress Report (March 1 - 
May 31, 1981) , JJRS Contract Report 053181, Implanentation of this grid 
technique in the 1979 data set could have resulted in a mximum of 78 test 
fields, each 25 rows by 25 columns in size, or a total of 48750 pixels. This 
maximum or best case situation would have resulted in 27,2% of the pixels in 
the flightline being used as test fields. However, any test field in conflict 
with previously designated training fields or cluster blocks was reduced in 
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size until the conflict was removed, and, of course, most test fields did not 
fall in a location where they could be designated as a full 25 x 25 pixel size, 
A sunmary of the number of pixels for test areas in each cover class that 
resulted from this procedure is shown in Table 4.9. As indicated, the actual 
number of test pixels obtained using this technique was 10195, or 4.5 percent 
of the total data. A significant problem with this procedure is indicated by 
the fact that sane cover types were poorly represented iii the test data set. 
This problem indicated a need for a different method of selecting test data in 
a statistically unbiased manner. 

The method determined to offer the best solution to the problems 
previously encountered in defining test data sets again involved the 50 line x 
50 column grid, and has been designated as the "Sample Block Test Data" 
technique. With this technique, a set of primary sample blocks, each of which 
was 25 X 25 pixels in size, were designated in the upper left corner of the 50 
line X 50 column grid. The analyst then defined one test field for each cover 
type or information class present within each 25 x 25 sample block. Each test 
field was defined so as to include the largest possible rectangle of the cover 
type involved, regardless of the density, condition, or other variability of 
the cover type present. It was believed that this procedure precluded most of 
the potential analyst bias that may be present in using a straight "supervised" 
approach, but would provide a reasonable sample of all cover types present, 
with the number of pixels representing each cover type, being approximately in 
proportion to the area of that cover type present in the flight line. Table 
4.9 shows the results of this aH)roach for defining a test data set for the 
1979 data. Each cover type appears to be reasonably well represented. 
However, it should be noted that because there is such a large amount of 
hardwood present in the study site, the hardwood cover type represents a large 
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propocticm of the test data, and therefore the overall classification results 
will tend to be dominated by the classification performance of the hardwood 
cover type, 

c. Classififation of the 1979 Training Data 

After development of the training and test data sets, they were evaluated 
using a Gaussian Maximum Likelihood (GML) classification and all seven wave- 
length bands. The results for the training data, defined using the supervised 
method are shown in Table 4,10, Such high classification performance indicates 
that all cover types defined for the 1979 data set are indeed spectrally 
separable. Note that such a conclusion is all that can be obtained from such a 
table of training data results — such a table cannot be used as an indication of 
overall classification performance throughout the entire flight-line. Table 
4,11 shows the training data classification results using only four wavelength 
bands (Channels 2, 4, 5, and 7), Use of only four bands still resulted in 
highly accurate classification results, thereby confirming the results shown 
previously in Figures 4,15 and 4,16, which were based on Transformed Divergence 
values of training data, and which had indicated that four wavd>ands should 
result in accurate overall classifications as well as accurate classifications 
of each of the individual cover types. 

Classifications of the training data using the Multi-Cluster Blocks 
approach were <±>tained, but cannot be shown in tabular form because in this 
technique each X-Y coordinate within the cluster block is classified 
independently. Map printouts of the training blocks were con^ared to the 
aerial photos, and appeared to provide highly accurate classifications. 
However, only the results using test data sets will provide an effective 
oc&i^rison between training techniques. Likewise, the results using the test 
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Table 4.11* Training Class Performance Using Four Channels \ 2 , 4, 5, & 7) of 1979 IMS Data 
on Supervised Training Statistics and a GML Classifier, 
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data set must be used tc evaluate the effectiveness of using different numbers 
of wavelength bands and differait combinations of wavelength bands. Therefore, 
the remainder of the classification results for the 1979 data are based only on 
tabulation of the results for the test areas defined by the "Sample Block Test 
Data" technique, described previously. 

d. Classification of the 1979 and 1980 Test Data Sets 

The wavdaand evaluaticxi stuc^ was rather involved, due to the large 
numbers of channel combinations we wished to evaluate in addition to the 
desired comparison between two separate sets of training statistics. Table 
4.12 is a summary table showin*^ the overall classificatiai performance along 
with the wavelength bands used for the various classifications. Since two sets 
of training statistics were involved, the feature selection algorithm often 
indicated different canbinations of wavdaands as the "Best 2", "Best 3", etc, 
Ttius, as shown in Table 4.12, there is considerable variation in the channels 
defined as the "Best n" wavdaand combinatiai for ilie two different sets of 
training statistics. (This also tends to indicate the "data-dependont" nature 
of these results.) 

A complete set of the classification performance tables (or "confusion 
matrices") and statistical sunroary tables for the waveband evaluation study are 
shown in Appendix A of this report. The classification results tables (Nos. 
2-28) are indicated by the table numbers shown in Table 4.12. Tables A-29-36 
of i^jpendix A contain the statistical evaluation surataries for this waveband 
evaluation stu(^. 

In order to provide some order in evaluating this mass of classification 
results, the initial phase of this discussicn conpares the test results based 
on the Si^Jervised and WCB (Multi-Cluster Blocks) training statistics using all 
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Tsble 4.12. Sunnary table of overedl clsissiflcation results^ table location, 
and channel subsets of the 1979 Waveband Evaluation: GNL 
algorithm, sample block test data. 


1 2 3 4 5 6 7 Training Statistics 

WAVHiAND 0.45“ 0.52- 0.63- 0.76- 1.00- 1.55- 10.4- 

OCttnWPlON 0.52 0.60 0.69 0.90 1.30 1.75 12.5 Si^ervised MCB 
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seven channels of the 1979 IMS data. It was thought that this would provide a 
"base-line" set of test results against which all other channel combination 
sub-sets could be ccmipared, and would also provide an initial basis for 
centring the two methods of developing training statistics. The renvainder of 
tile discussion on waveband evaluation phase of this stu(^ is divided into 
several sections as follows: 

(a) Comparison of the classification resuits obtained with different 
numbers of wavelength bands (i.e. , the "Best" 2 through 7 bands). 

(b) Con^iarison of different combinations of three wavelengtii bands, 
based on the 1979 test data set. 

(c) Comparison of differ e:it combinaticxis of four wavelength bands, 
based on the 1979 test data set. 

(d) Evaluation of the classification results for the 1980 test data 
set, using all eight and the "best 4“ wavelength bands. 

Tables 4.13 and 4.14 show the results of classifying the 1979 test data 
using all sevai wavelaigth bands, based on the Supervised training statistics 
and the Multi-duster Blocks training statistics, respectively. Since both 
tables are based on all sevoi wavelaigth bands, they represent the best 
possible classification accuracy one could expect using this data set and these 
sets of training statistics. Because these tables are based on a statistically 
defined set of test data, they can be considered to be representative of the 
classification performance throughout the eitire flight line area.^ 


^Conventionally, results are evaluated only on the basis of the relative 
rate of omission . Instances of omission are ttie non-diagonal row elanents of 
the error matrix. Omission is of primary interest to those concerned with the 
likelihood of an area "known" to be of the i(th) cover class being classified 
as some other cover class. The commission error is equally a part of the error 
frequency associated with a classification. Commission error is represented by 
the non-diagonal column elemaits of the error matrix. This index of error is 
•of interest to those concerned with the likelihood of an area being classified 


Table 4.13* Wavdaand Evaluation Classification Results Using All 7 Channels. (1979 TMS Data 
St 5 )ervised Training Statistics, OIL Classifier) 
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Table 4.14, Waveband Evaluation Classification Results Using Ml 7 Channels. (1973 TI'IS Data 
fCB Training Statistics, OIL Classifier) 








As can be seen, both sefa^: of -training statistics resulted in highly 
accurate overall classification results, although some of the individual cover 
classes had surprisingly lew classification performance. Differences between 
the two sets of training statistics resulted in distinct differences in 
classification performance for some of the cover types, such as tupelo (67.8% 
vs 83.9%), clearcut (64,9% vs 45.7%), and pasture (83.4% vs 61.4%). A 
statistical comparison (Newman-Keuls Multiple Range Test) indicated that the 
overall classification performances were significantly different (a - 0.10), 
and that among the individual cover types, only the pine and soil classes were 
not st-^tistically different. However, because there is so much variability 
from one cover type to another as to which set of training statistics provided 
the best classification, it is not clear that either method of developing 
training statistics is distinctly better than the otlier. Soone cases where the 
MCB approach was much better than the supervised, such as tupelo and crops, 
were quite surprising, and would seem to indicate that the supervised training 
data had not been adequately representative of tlie spectral characteristics of 
those cover t^'pes, 

A comparison of classif icaticMi results for the "Best n" (2 through 7) 
channel combinations was a key element in the wavdDand evaluation phase of this 
study. The "Best n" channel coirbination was based on the "Feature Selection" 
algorithm, which was based on a divergence algorithm, as discussed earlier. 


as the i(th) cover class when actually the area is in some otlier cover class. 
Both of these forms of misclassificaticai constitute a legitimate error. The 
problem of providing a meaningful index for evaluating a classification arises 
when the evaluation is conducted ijy cover class , since the use of either 
measure will result in the same coiqputed "overall" classificatioi performance. 
Ihe problem is most crucial when the two error caiponents are poorly 
correlated, which is often the case. Work is needed to determine a legitimate 
and effective methodology for combining the two error components. 
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Bie analyst can use this divergence algorithm to define the "Best n" channel 
combination based on the minimum divergence between any two spectral classes , 
thereby helping to define a channel combination that will improve the classifi- 
caticxi performance for those spectral classes that are hardest to separate. 
The analyst could alternatively ask for the "Best n" channel combiration based 
on average divergence^ which would indicate the channel ccmibination that should 
enable the best average classification to be obtained. After stxne initial 
evaluations of the data, it was determined that several canbinations of 
channels often provided the same average divergence values (especially when 
more than three channels were involved) , so throughout this phase of the 
research, the channel combinations used were defined on the basis of the 
irdnimura divergence values defined by the feature select processor. 

Tables 4.15 and 4.16 show summaries of tiie results^ by cover class as well 
as overall and average performance percentages, for the "Best 2" through the 
"Best 7" waveband combinations, for the Supervised and the MCB Training 
Statistics, respectively. (These summary figures were obtained from Tables 2-7 
and 15-20 in /appendix A.) As indicated in Table 4.15, the classifications with 
CMily two or three channels were much lower in both overall and average 
classification performance than when more than three channels were used. It is 
also noteworthy that the feature selecticn algorithm defined a completely 
different set of channels (or waveleigth bands) as the "Best 3" than had been 
defined as the "Best 2". Note also that when only two or three channels ar«| 
used, the classification performance of some of the individual cover types may 
be considerably lower than when four or more channels are used. When more than 
four channels are used, the classification (for individual cover types, as well 
as overall and average) tends not to change very much, although the highest 
accuracy is generally achieved when all seven channels are utilized. 


Table 4.15. "Best n" Channels Classification Results Sumnary for the Supervised Training 
Statistics (1979 IMS Data, GML Classifier, Sanple Block Test Data). 
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Table 4.16. "Best n" Channels Classification Results Sunmary for the MCB Training Statistics 
(1979 IMS Data, GML Classifier, Saii 5 )le Block Test Data). 
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Table 4.16 shows the same treids and genered results seen in Table 4.15, 
although the overall and average classification performances are generally 
lower lor the equivalent number of channels. (This coi^parison of overall 
classificatiCTi performances probably can be observed more easily using Table 
4.12.) Table 4.16 also indicates that some individual cover types, notably 
tupelo, were classified considerably better when at least five channels were 
used. 

As shown in Table 4.12 as well as in Tables 4.15 and 4.16, there appears 
to be no definitive combination of wavelength bands that provides a 
distinctively optimum classification, although there are observable differences 
between the two sets of training statistics. For instance, use of the 
Si^r vised training statistics resulted in the Thermal IR channel being used as 
one of the "Best 4", "Best 5", and "Best 6" channel combinatiais, whereas with 
the MCB training statistics, the Thermal IR channel was not included until the 
"Best 6" channel corobinaticxi was defined. 

In sumnary, it would appear that a combination of four channels would 
produce much better classification results, both overall and for the individual 
cover types, than when three channels or less are utilized. Furthermore, if 
more than four channels are used, there is no evidence to suggest that 
significant improvements in classification performance can be obtained. These 
statements can be made for both the Supervised and the MCB training statistics. 
Such statemaits also support the previous results shown in Figure 4.15 and 
4.16, even though those figures were obtained using an entirely different set 
of training statistics. 

The next phase of the wav^ijand evaluation stu<^ involved classifications 
beused on various ooirbinaticxis of three channels of data, lliese results are 
summarized in Tables 4.17 and 4.18. As shewn in Table 4.17, the overall 


Table 4.17. Classification Results Sunmary for Various Three Channel Conibinations and the 

Supervised Training statistics (1979 TMS Data, Qtti Classifier, Sair{)le Block Test 
Data) . 
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O^le 4.18. Classification Results Suninary for Various !Hiree Channel Combinations and the 
IKB Training Statistics (1979 IMS Data, (HL Qassifier, San^le Block Test Data) 
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results are generally quite different, with the "Best 3" channels defined by 
the Average Transformed Divergence having the best overall classification for 
three channels. Table A-29 in i^pendix A indicates that these overall 
classification results for different combinations of three channels are all 
significantly different. Table A-30 in i^jpendix A indicates significant 
differences amcng the various conbinations of three channels for individual 
cover types, and shows, for example, that without at least one channel in the 
reflective infrared portiai of the spectrum, water is poorly classified, 
whereas use of cmly the visible wavelengths enabled tupelo to be classified 
with much higher accuracy than with any other combination of three channels. 
In fact, the use of cnly the visible charfnels enabled tupelo to be classified 
with essentially the same accuracy as obtained when all sevoi channels were 
used. Pasture was classified very poorly when only visible channels were used 
but quite well when only the reflective infrared portion of the spectrum was 
involved. Both the visible and near infrared appear to be inportant in 
cbtaining a reasonably accurate classification of hardwood with this data set 
and the Si^^ervised training statistics. 

When using the MCB training statistics and different combinations of three 
channe].s, the results obtained were very similar to those based on the 
supervised training statistics, as shown in Tables 4.18, A-31 and A~32. One 
notable result on Table 4.18 involves the water class, which has extremely poor 
accuracy unless a reflective infrared channel was used in the classification, 
A similar result was shown for the supervised statistics, but it was not as 
dramatic an example of the importance of particular wavelength regions for 
accurate classification of some cover types. 

The fact that both sets of training statistics produced similar classifi- 
caticn performances indicates that the results obteined are largely a function 
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of the spectral characteristics of the various cover types rather than of the 
tredning statistics. 

ijlie next j^tiase of the stu^ involved analysis of various combinations of 
four wavelength bands. Tables 4.19 and 4.20 suninarize the results obtained, 
based on the Supervised and the MCB Training Statistics, respectively. Tables 
A-33, 34, 35, and 36 in i^ppendix A shew the four channel «:xjmbinations that are 
significantly different. Use of four channels produced rather accurate 
classificatiem resul.ts — much better than could be obtained with exUy three 
channels in general. With both sets of training statistics, the four channel 
ccmbinatiai that most closely simulates the Landsat wavebands provided the 
highest overall classification, perhaps in part because this wav^Dand 
combination seemed to be particularly effective in classifying har^oods, as 
well as tupelo, pine, and exposed soil. Thus, these results do not suggest any 
particular advantage to using wavebands in portiems of the spectrum beyond 
those to which Silicon detectors (used in Multi-Linear Array systems) are 
sensitive, at least if the primary purpose is differentiation araoig, and 
identificatiCTi of, various vegetative cover types. However, if c*ie is dealing 
with vegetative stress conditicxis or other cover types, there may be distinct 
advantages to using data from the Middle Infrared or Thermal Infrared porticxis 
of the spactrum. It is sinply a situation in which the condition of the 
various cover types and the data involved in this stu(^ do not show any clear 
indications that the Middle or Thermal IR portions of the spectrum are more 
important than the Visible and Near IR regions. Hewever, it is noteworthy that 
the wavelength bands on the scanner used in this stu<ty (and on the Thematic 
Mapper) in the Visible and Near Infrared regicsis are spectrally much narrower 
than the channels on the Landsat MSS scanners. ^H)erefore, the classification 


Table 4.19. Classification Results Sutmary for Various Four Channel 

Contoinations and the Supervised Training Statistics (1979 IMS 
Data, GML Classifier, Sample Block Test Data) . 
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Table 4,20, Classification Results Sumnary for Veirious Four Channel 

Combinations and thi MCB Training Statistics (1979 TMS Data 
GML Classifier, Sample Block Test Data) . 
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accuracy seen in these results may be due, at least in pact, to the spectral 
resoluticxi of the data being used. 

The waveband evaluaticn based on the 1980 data set was also conducted 
using both Si^rvised and Multicluster Blocks (NCB) training statistics. 
Initial classifications using all eight wavel^gth bands available and the 
"best 4" wavebands produced results that were generally similar to those 
obtained with the 1979 data, although the classificatic»i accuracies were 
generally somewhat lower. The overall classificatiai performance based on test 
fields was 88.5% for all eight wavebands and 82.8% for the "best 4" wavebands 
(1, 2, 3, St 6), using the Supervised training statistics, whereas with the MCB 
training statistics the results shewed 79.8% and 79.7% overall performance for 
all eight and the "best 4" (1, 3, 4, & 5) wavebands, respectively. The 
performance tables for these four classifications are shown in Appendix B, 
Tables 59, 62, 65, and 68. One of the most noticeable results using the 1980 
data set involved the very low classification accuracies obtained for tiipelo. 
These ranged from oily 17.9% to 20.0%, evai when all wavelength bands were 
used, and for either set of training statistics. It is interesting to note 
that with the Supervised training statistics, most of the misclassified tupelo 
pixels were being identified as regenerating hardwood whereas with the MCB 
training statistics the misclassified pixels were being idaitified as hardwood. 
In either case, the poor performance for tupelo is attributed to seasonal 
changes in the spectral characteristics of tipelo as conypared to other 
hardwoods. Early in the growing season, the tupelo has a distinct spectral 
response (particularly in the visible waveloigths) that is quite different fro i 
other hardwoods, whereas later in the summer, the spectral response for tupelo 
is similar to that of the other hardwood cover types. This differaice — or lack 
thereof — betweai tupelo and the other hardwoods in the 1979 and 1980 data sets. 
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respectively, could be clearly seen on the color ihfrared jinotography that had 
been dbtained in conjunctioi with the IMS data. 

Since the 1980 data shored generally sijnilar results to those dDtained in 
1979 for the four channel and the ail channel classifications, further wavdaand 
evaluatioi classifications were not (Stained using the 1980 test data set. 

?lie waveband evaluation results, based upon both sets of training 
statistics as well as both the 1979 and 1980 test data sets can be sumnarized 
as follows: 


1. Use of four wavelength bands produced considerably better classifica- 
tion results than when only two or three wavelength bands were 
utilized, 

2. Maximum overall classification performances were chtained when all 
wavelength bands were utilized. 

3. ®ie increase in overall classificaticxi performance when more than four 
wavelength bands were utilizid was minimal, therefore, indicating that 
an a^rc^riate set of fcyr wavelength bands provides the best 
combination of high classification accuracy and minimal computer time. 

4. Various three and four wavelength band corabinatieis using the 1979 
^ta set indicated the inpnrtance of both the visible and near- 
infrared portiois of the spectrum for accurately classifying various 
forest and other cover types. 

5. These results, which were primarily focused on differentiation of 
various types of healthy vegetative cover, did not indicate any 
particular advantage for using wavelength bands in portiais of the 
spectrum beyond those to which Silicon detectors (used in Multi-Linear 
Array systems) are sensitive. 

6. Different combinations of three or four wavelength bands caused 
significant differeices in classification performance of various 
individual cover types, but overall classification acairacies did not 
provide any distinct trends indicating that certain wavelength tends 
were superior to others, (e.g. , When using four waveband contoina- 
ticHTS, several different corabinaticxis produced similar overall 
classification performances.) 

7. Bie Supervised method of developing training statistics provided 
slightly better overall classification results than the Multi-Cluster 
Blocks technique for both the 1979 and 1980 data sets. It would 
appear that for situations where accurate, reliable referaice data 
(i.e. , "ground truth") is available over the entire study area and for 
data having fine spatial resoluticxi, the Supervised technique is 



87 


generally best. It is particularly useful for waveband evaluation 
studies involving different covet types. 

8. Overall classification accuracies based on the "best 3" wavebands 
defined the average transformed divergence values were signifi- 
cantly higher than those based on the "best 3" wavd^ands defined by 
the minimum transformed divergence values. 
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E. CcMParison Amona Three CL ossification Alaorit^ 

Tlie analysis results discussed thus far have primarily involved the 1979 
IMS data (untransformed, 30 meter spatial resolution) and the GML (Gaussian 
Maximum Likelihood) classifier. The next £tese of the study involved an 
evaluation of the results obtained from the GML classifier as compared to the 
L-2 Minimum Distance Classifier and the SBCHO (Supervised Extractiai and 
Classificatiai of Homogeneous Objects) classifier. Conparisons among these 
three classification algorithms were again conducted using the untransformed 
1979 IMS data, but in addition, the three classification algorithms were 
applied to the untransformed 30-meter 1980 IMS data set in order to evaluate 
the repeatability, and reliability of the results obtained using the 1979 data. 
One must keep in mind, however, that the 1980 data were obtained about two 
months later in the growing season than the 1979 data (August 29, 1980 ys June 
30, 1979), and that all eight cliannels of the NS-001 scanner were functioning 
satisfactorily when the 1980 data were detained, whereas the 1.55-1.75 ym 
channel had not been functional at the time the 1979 data were obtained. 

The L-2 Minimum Distance classifier is based on a relatively simple 
classification algorithm and is much faster than the GML classifier. The SBCHO 
algorithm utilizes both the spectral characteristics and the spatial 
variability in the data in making the classificaticai decision. In view of the 
results showing the decreased classification performance with snaller spatial 
resoli 4 ti<xi data, it was thought that the SFXHO classifier might provide a 
distinct advantage over per-point classifiers (such as the L-2 and GML) when 
working with the 30-meter TMS data. 

In view of the previous excellent results obtained using only four 
channels of data, it was decided to conpare the classification algorithms using 
the ’’Best 4" wavelength bands, in addition, all sev^ (1979 data) or eight 
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(1980 data) waveleigth bands would be used to obtain additional insight into 
the value of using all available wavelength bands as compared to a four channel 
subset. It was also decided to use both sets of training statistics for all 
con^arisons as a further test of the repeatability of the results. 

Table 4.21 shows a sunmary of all 24 classifications conducted for this 
phase of the research, tables B-38-49 of Appendix B show the individial 
classification performance results for the 1979 data, and Tables B”50-57 show 
the statistical analysis results for ti-is 1979 data. Tables B-58-69 show the 
classificaticxi results fcr the 1980 data, and Tables B-70-77 show the 
statistical analysis results for the 1980 data. 

In examining the results of these classifications, as summarized on Table 
4,21, it is apparent that in all cases, the results obtained with the L-2 
classifier are considerably less accurate than those obtained with either the 
GML or the SE]CH0 classifier, and that the GML results are less accurate than 
those obtained with the SECHO classifier. Tables B-50, 52, 54, 56, 70, 72, 74, 
and 76 indicate that the overall classification accuracies shown on Table 4.21 
have statistically significant differences (a = 0.10) betweoi each of the 
classification algorithms for every data set ccxnbination (i.e, , every oxnbina- 
tion of waveleigths and training statistics, and for both the 1979 and 1980 
data) I Thus, the SECHO classifier clearly provides significantly better 
classificaticMi results than can be obtained with per-point classifiers. 

Table 4.21 also shows that when classification results for tlie Supervised 
and Multicluster Block training statistics for the same mmber of channels are 
ccxnpared, the Supervised training statistics resulted in better classificaticn 
accuracies in all cases except for the SECHO classifier and the 1979 data. 
These differences due to the training statistics used were greater with the L-2 
classifier than with the GML or SECHO classifiers. 
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Table 4.21. Sumnary table of overall classificaticn results for 
the L2, GHL and SEXHO classifiers. (Untransformed 
1979 and 1980 IMS data. Supervised and MCB training 
statistics, sample block test data) • 


I) 1979-Untran sformed TOS Data 


Training Statistics 
and Channel Combination 


Cl^pifipr 


SuBeiyjgaa 

12 .. 


SBCHO 

Best 4 (CH'S 2, 4 ,5 ,7) 

81.8% 

88.1% 

90.0% 

All 7 Channels 

85.3% 

90.7% 

91.6% 

Mwlticlugtet.. BlwK 




Best 4 (CH'S 1,3, 4, 6) 

77.4% 

86.1% 

90.6% 

All 7 Channels 

81.4% 

88.7% 

92.3% 



1 




II) 1980 Untra nsformed TMS Data 
Training Statistics 


and Channercombination 


Cl^sifiec 


Supervised 

-12— 

OIL 

SBCHO 

Best 4 (CH'S 1,2, 3 ,6) 

75.3% 

82.8% 

85.9% 

All 8 Channels 

77.5% 

88.5% 

89.6% 





. Best 4 (CH'S 1,3, 4,5) 

67.6% 

79.7% 

84.6% 

All 8 Channels 

70.2% 

79.8% 

84.2% 
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It is also apparent in examining Table 4.21 that seven or eight channels 
of data did enable more accurate classification results to be obtained than 
when only four channels were used (except in tiie case of the 1980 data with the 
Multicluster Blocks statistics and SEO!0 classifier) . However, in many situa- 
tions, the difference in performance due to the larger number of channels used 
was only about 2%. 

It would appear, in general, that the best overall results can be achieved 
using the SBOiO classifier. However, the 1979 and 1980 results using the SBCHD 
classifier do not indicate the same trends in relation to the method of 
developing training statistics and the nmriber of channels involved. With the 
1979 data, the MCB method for developing training statistics was best, whereas 
in 1980, the supervised method was best (particularly when all eight channels 
were used) . 

The statistical analysis of results for individual cover types showed 
that, in goieral, there were significant differences between the L~2 and OIL 
and the L-2 and SBCHO classifiers, but that only the hardwood cover type 
consistently produced significant differences between the (31L and SEICHO 
classifiers, for both the 1979 and 1980 data sets. The tupelo generally had a 
much lower classificaticn performance in 1980 than was the case for the 1979 
data, which we believe is due to phenological differences, with the tupelo 
having a rather distinct spectral characteristic in 1979 (which resulted in a 
rather unique magenta aj^jearance on the color inf rared photos) , whereas at the 
time of year the 1980 data were obtained the tupelo was spectrally similar to 
the otlier hardwoods. The claarcut areas (or regenerating hardwoods) were also 
much more difficult to classify in the 1980 data than had been the case with 
the 1979 (^ta set. 
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In sumnary, the results of the oonparison among classification algorithms 
indicated that: 


1. The Ir-2 Minimum Distance algorithm produced significantly less 
accurate classifications than were obtained using either the GML or 
the SECHO algorithms, 

2. The SEICHO algorithm consistently resulted in higher overall classifi^ 
cation performances than were obtained with the GML algorithm, 
regardless of the data set or training statistics being utilized. 

3. Overall classificaticxi performances of 85-90%, based on test data 
sets, were obtained for both the 1979 and 1980 TNS data when four 
or more wavelength bands were utilized in conjuncticMi with the 
SEXMD classifier and either the Si?)ervised or Multi-Cluster Blocks 
training statistics, 

4. Ihenological effects caused distinct differences in spectral response 
for some cover types, especially tipelo, when comparing the 1979 and 
1980 data. 
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F. Effectiveness of the Principal Corewnents Transformation ln_Data Analysis 

llie next phase of this project involved Uie evaluation of the principal 
coivponents transformaticxi on classification performance. Sometimes the 
question has been raised as to why a "feature selection" procedure should be 
used to reduce the number of wavelength bands for classifying a data set, as 
exposed to simply using the first three or four principal con^jonents of the 
data. Both "feature selection" and principal cem^aonents are data dimaision- 
ality reduction techniques. The advantage of the principal coitponents 
transformation is that it is a very automatic procedure for reducing the 
dimensicHiality of multispectral data. However, there are various methods 
available for deL-.ning the statistics used to calculate the principal conp>nent 
transformaticxis. This phase of the research was corseted, therefore, to 
evaluate the use of principal component tr,ansforroations, as coirpared to 
selected wavelength bands of ivntransformed data, for classifying forest and 
other cover types, based on OMS data. 

A Karhunen-Loeve or Principal Conp>nent Linear Transformation was applied 
to the 1979 TMS data set, using a 4% sample of pixels (every fifth line and 
fifth column) to calculate the statistics, including a mean vector and 
covariance matrix. The Karhunen-Loeve transformation then calculates the 
eigenvectors (transformed components) associated, with this sample covariance 
matrix, ordered in such a way that a maximum amount of data variability is 
accounted for in descending magnitude along these ccwp)nents. One particular 
advantage of the K-L transformation is that it uncorrelates the data in 
N-dimensicxis, i.e. , the transformed ccanponents are mutually orthogonal, so that 
any redundancy of information caused by interband correlaticxis of the original 
channels is removed. Tables C-10 8 and 109 in Appendix C give the statistics of 
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the original THS data (sampled every 5th line and 5th cx>lumn) and the resulting 
eigenvectors (transfonned con^nents) and eigoivalues, respectively, calculated 
from the covariance matrix of %|ds sampled TMS data. The information content 
cussociated with the ordered transfonned components for the 1979 K-L trcuisformed 
data set is shown in the form of a bar graph in Figure 4.17. As can be seen, 
the first oofi^nents alone contains over 50% of the variance or information 
content in the data, and the first three oon^nents together contain 97.8% of 
the variance. 

A supervised set of training statistics was generated from the K-L 
Transformed data and the same set of 1979 sample block test areas used 
previously were again used in this jhase of the study. Ihe data were 
classified using the 12 , ®SL, and SECHO algorithms with the first 3 and 4 then 
the first 4 components. Results from these classifications were compared to 
those obtained from the "optimum" three and four channel subsets of the 
original TMS data (as determined by ^SEPARABILITY) and are summarized in Tables 
4.22 and 4.23. i^p^dix C includes the classification performance tables 
(Tables C-80-91) as well as tables of the statistical comparisons amcxig the 
results (Tables C-92-107) . 

In evaluating the results, it is apparent that the value of the K-L trans- 
formation is strongly influenced by the classification algorithm used, 
particularly when oily three channels of data are involved. Table 4,22 shows 
that when the L-2 algorithm is applied to the data, the classificaticm 
performances were better for the transfonned data, as compared to the 
untransformed data, for all cover types except water. Table C-95 indicates 
that these differences were statistically significant (a = 0.10} for all cover 
types except tupelo and water. However, with the GML and SECHO classifiers, 
use of the transformed data resulted in significantly better classification 
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Table 4.22. Conbined conparison table of the overall and individual cover 
class classification performcuices between the uiitransfonned OMS 
and the K-L transformed data for all three classifiers using 
"optiimim" three channel feature sets. 


C30VER 

DATA SET 

...M 



OASS 

DESCRIPTION 

L2 

GML 

SECHO 


Untransformed IMS (CH's 1^3,6) 

76.9% 

94.7% 

96.5% 

PINE 





K*-L Transformed Data (Components Ir2f3) 

89.0 

90.1 

91.2 



69.1 

77.8 

89.1 

HDMD 

Sene as above 

80.9 

85.9 

91.3 



45.8 

21.2 

22.0 

TUPE 

Sante as above 

50.8 

45.8 

52.5 



49.5 

68.1 

74.6 

CCUT 

Same as above 

61.1 

47.8 

50.8 



43.4 

62.3 

68.3 

PAST 

Same as above 

69.4 

80.0 

84.9 



27.6 

61.5 

62.9 

CROP 

Same as eibove 

89.7 

87.0 

87.3 



50.4 

89.8 

92.0 

SOIL 

Same as above 

75.2 

74.3 

70.6 



88.3 

88.0 

81.3 

HATER 

Sane as above 

87.0 

76.3 

73.0 


65.2 7 8.4 86.8 

OVERALL Same as above 


80.0 82.9 86.6 
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Conbined confarison table of the overall and individual 
class classification performances between the untransfonuid W 
and K-L transformed data for a] 1 three classifiers using 
"(^iinun" four channel feature sets. 


OCVBl 

CLASS 


DATA SET 
DESCRIPTION 


GML SECHO 


PINE 


Untransformed IMS (CH's 

K-L Transformed Data (Conp)nent8 l,2r3,4) 


85.5% 91.0% 92.9% 


HDWD 


Same as above 


lUPE 


Same as above 


OCUT 


Same as above 


JTAST 


Same as above 


CROP 


Same cis above 
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performances for some cover types but significantly worse classifications for 
other cover t 2 ,*pes (see Tables C-97 and 99). Somewhat similar results were 
found when four channels of data were used^ as shown in Table 4.23, although 
the differences between the untransformed and transformed performances 
generally are smaller, particularly with the l- 2 classifier. 

The overall classification performances are compared in Tables 4.24 and 
4.25. Table 4.24 shows that the transformed data resulted in significantly 
better performance wl^n the L-2 classifier was used for both the three and four 
channel situations. However, when the GML algorithm was used, the transformed 
data had a better overall performance for the three channel situation but the 
untransformed data was better with four channels. For the SEICHO classifier, 
there was no differ^ce for three channels and the untransformed data was best 
when four channels were used. Table 4.25 shows that the differences between 
classification algorithms generally were significant for either three or four 
channels and with either the untransformed or trcinsformed data sets. 

These results could be sunmarized as follo/s: 


1. !Ihe K-L transformation (with 4 components) gaierally incr^ised the 
ovcirall classification {lerformance of the L-2 classifier, whereas the 
overall classificLtion was significantly decreased for both the GML 
and SECHO classifiers. 

2. For individual cover types, the CML and SECHO performances tended to 
be rather similar — both would either increase or decrease by a similar 
amount for a particular cover class with a K-L transformaticair— whereas 
the L2 classifier tended to react in the opposite way; i.e. , when the 
GML and SECHO classification cover class performances decreased with a 
K-L transformation, the L2 increased, and vise versa (with the 
exception of the CCUT and WATER categories) . 

3. Qhe K-L transformation and the L-2 classifier inf>roved all cover class 
performances when using three channels (i.e., conponents) and most 
cover class performances when using four channels. 

. A K-L transformation and the OIL classifier improved some (i.e., half) 
of the cover class performances when using three channels 
(components) , but when using four channels the classification 
performances were often considerably better with untransformed data. 
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Table 4.24. Suninary table of overall classification performances comparing the 
untransformed TMS and the K-L transformed data sets foe all three 



classifiers. 

Data Subset: "Best 3" Channels or 1st 3 Comoonents 



Untransformed TMS^ 

Table 

K-L Transformed Data 

Table 

C;iassifier 

(Channels 1.3.6) 

Location 

(Coiponents 1.2.3) 

Location 


65.2^ 

(Table 80) 

80.0%^ 

(Table 83) 

GML 

78.4^ 

(Table 81) 

82.9*^ 

(Table 84) 

SBCHO 

86.8® 

(Table 82) 

86.6® 

(Table 85) 


Data Subset: "Best 4" Ch annels or 1st 4 Components 


Classifier 

Untransformed TMS^ 
(Channels 2. 4. 5. 7) 

<r li— 

Table 

Location 

K-L Transformed Data 
(CODDonents 1.2. 3. 4) 

Table 

Location 

12 

81.8® 

(Table 86) 

83.8*^ 

(Table 89) 

OIL 

88.1*^ 

(Table 87) 

84.6® 

(Table 90) 

SEX310 

90.0*^ 

(Table 88) 

87.0® 

(Table 91) 


^Significantly different overall classificaticxi performances between the 
untransformed and the K-L transformed data sets for each classifier is 
indicated by a differait superscript (based upon a Newman-Keuls comparison 
with a = 0.10) . 
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Table 4.25. Sunrery table of overall class performances for three algorithms (L2, (XL, 
SECHO) based upon four data sets. 


Overall Classification Performance (%) 
Data Set Description by Classifier (and Table Location) _ 





cm - 

SECHD 

3 Channels (1,3,6) r Untransformed 

65.2® 

(Table 80) 

78.4*^ (Table 81) 

86.8^ 

(Table 82) 

1st 3 Conponents, K-L Transformed 

80.0® 

(Table 83) 

82.9*^ (Table 84) 

86.6^ 

(Table 85) 

4 Channels (2, 4,5,7), Untransformed 

81.8® 

(Table 86) 

88.1*^ (Table 87) 

90.0° 

(Table 88) 

1st 4 Conponents, K-L Transformed 

83.8® 

(Table 89) 

84.6® (Table 90) 

87. 0^^ 

(Table 91) 


^Differ^t superscripts betweei columns of the same row indicate significantly different 
overall classification performances between classifiers (based upon a Newman-Keuls 
comparison with a = 0.10) . 
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5. In geieral, it appears that for classifications using fewer nunt>er of 
channels (features) than is qptimum for a particular data set (i«e. , 
the intrinsic dimensiCTiality of the data, which in this case is four, 
a K-L transformation will in 4 >rove overall and most cover class 
perfoimiances. However, if the number of channels is equal to the 
intrinsic dimensionality of the data, the original untransformed data 
appears to provide better class separability and subsequent 
classification performance. 
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Vc SYNTHETIC APQCTURE RAI»R (SAR) DRIA ANALYSIS 
A. Date CoUegtiffl 

Die second major ^iiase of this research project involved the analysis of 
the SAR data. The test site and the reference data used were the same as those 
involved in the TMS data analysis, and have alrea^ been described in Sections 
III and IVA3. 

Due to the aircraft schedules and equipment difficulties, we were 
unsuccessful in obtaining radar data during the 1979 growing season. However, 
Radar Mission No. 424 was successfully flown on June 30, 1980. This was the 
first (and only) radar data obtained in support of this proj(*ct. The sensor 
used was the APQ-102 side-looking synthetic aperture radar, flown in the NASA 
WB-57 aircraft at an average altitude of 60,200 feet MSL. Small scale 
(1:120,000 scale) color infrared photography was also obtained of the study 
Site as part of this mission. The photography indicated that the area was 
about 30-40% covered by cumulus clouds at the time the radar data were 
obtained. It might be worth noting, however, that the radar data showed no 
indication of the presence of clouds, thereby providing an excellent example of 
the fact that radar does indeed provide effective penetration of clouds 1 

The APQ-102 side-looking radar is a fully focused synthetic aperture radar 
imaging system. A horizontally polarized pulse of energy of 9600 MHz +5 MHz 
(i.e. , XHBand) was transmitted by the radar ^stem, and the returning aiergy 
was recorded on separate holograms as horizontally (HH) and vertically (HV) 
polarized responses. Diese holograms were then processed through an optical 
correlator by Goo<^ear Aerospace Corp. in Arizona, and the resulting images 
recorded on positive film, which was the format in which the data were provided 
by NASA to UiRS. 


!Hie positive-imge film was received at LARS on August 8r 1980. Black and 
white negatives and positive prints were then made of the radar film for 
handling and interpretation purposes. 

Visual examination of the imagery indicated that there was a very distinct 
dark band running the length of the imagery that was particularly distinct on 
the HH polarization but also fairly noticeable on the HV polarizaticsi (see 
Figure 5.1). It was also found that there was very little side-lap between 
Flight Lines 1 and 2. This lack of side-lap, in combination with the image 
quality dif fixities, caused the analysis of the radar data to be confined to 
Flight Line 1 for the area south of Camden along the wateree River and to the 
upland terradn in the regicai north of Camden. Hie radar data in these areas 
were of satisfactory quality in both polarizations. In additio;*i, the area 
south of Camden corresponded very well to the area covered by the cloud-free 
MSS data /i^tained in 1979 and again in August 1980. 

Because of the problems with image quality and the lack of overlap between 
f, light lines, detailed analysis of forest cover as a function of look angle 
(using the overlapping area of the two flight lines) could not be pursued with 
the radar data set obtained for this stuc^. 
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Figure 5.1. Radar images of Flight Line 1 for the HH and HV polarizations. 

The area for which MSS data were also obtained is outlined in 
white. 
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B. SAR Data Handling 
1. Digitization 

To convert the radar imagery into a nunKU'ical format, the positive film 
imagery was digitized using a microdensitoneter. Both the HH and HV polariza- 
tion images were digitized by the Lockheed Corporation at JSC, 

The parameters fca: digitizing the imagery were calculated using the 
specifications of the radar system and an approximate scale of the imagery, 
nie scale was determined by making several measurements between points on the 
radar imagery and USGS topographic maps. Acxx>rding to the characteristics of 
the system, the ground resolution for both the across track and alcng track 
resoliyzions was slightly less than 15 meters. This resoluticxi performance was 
therefore defined as the minimum allowable dimension for a ground resolutioi 
elemoit. Based on the 1:376,000 scale of the positive film image, it was 
determined that an aperture setting of 40 ym cxi the microdensitoneter would 
provide a digitized pixel having a spatial dimeision of 15 meters, thereby 
approximating the ground resolutioi of the SAR system. Both the sampling 
interval and scan line spacing were set at 40 ym to prevent any sidelap and 
overlap of adjacait pixels, thus providing independence between pixels. If 
there was any sidelap and/or overlap of the pixels, the variance between 
adjacoit pixels would have beoi reduced. This would not have allowed an 
effective conparison among various classification algorithms, since some 
algorithms are more sensitive to differ^ces in variance than others, and one 
of the basic assumptions of mcxst algorithms is that the individual samples are 
independait. 

Figure 5.1 shews the entire radar image of the test area for both the HH 
and LW polarizaticxis. On the HH £X)larization there is a distinctive dark band 
running through the entire flight line, covering approximately 30 parent of 
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the data set. The portion of the data covered by this dark band could not be 
used, so the final area digitized was approximately 2.7 cm x 11.7 cm, which 
represented an area of 6 miles by 27 miles on the ground. The 40 ym aperture 
setting resulted in 674 samples per line and 2897 lines of data, for a total of 
1,952,578 pixels. 

2 . Reformatting 

The digitized radar data were recorded directly onto 7-track tapes, which 
were later cqpied onto 9-ttack tapes in order to cxxivert the SAR data into 
LARSYS format. Some problems were aicountered in the quality of the digitized 
tapes because the same gain setting had been used to digitize both the HV and 
HH polarizations, thereby causing the HH data to be saturated in response. 
This was corrected by redigitizing the radar imagery, and in May 1981, the 
final set of digitized SAR data were received by LARS. 

Since the HH and HV Images were digitized independently, the data had to 
be overlaid (i.e, , share the same line and column-coordinates) before being 
combined onto a single LARSYS data tape. Initial attempts were made to overlay 
the entire flight swath of the two data sets using first and second order 
polynomials. A set of 19 control points were id^tified, randomly scattered 
throughout the data on each polarization using photo- interpretation techniques, 
and were checked using an image correlation program. The overall results frcsn 
the models were given in terms of RMS (root mean square) error RMS errors 

1 / 2 
The RMS error is an unbiased estimator of a for the model (Steel and Torrie, 

1980) . It is defined as: 
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This expressicm defines the accuracy of a single daservation. 


RMS = 



where, _ n-1 

L = sample mean, 

1. = ith observatiai, 
n = total number of observaticns. 
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less than 0.5 for both line and column coordinates were considered to give the 
accuracy needed for the image registraticxi prcx:ess (Stnith, 1980) . Tbe results 
of both the first and secx^nd order polynomials did not provide acceptable RMS 
errors. Examination of the data indicated that a cAirvilinear orientation with 
more than one inflection point existed in the along-trac^k direction between the 
data sets. This type of orientation may have developed through a combinaticai 
of variables such as caused by the dual receiving antennas of the APQ-102 radar 
^stem and electronic equipment instabilities. 

To compensate for the geonnetric variabilities, the data along the flight 
line was divided into four separate blcx:ks. Over 30 potential control points 
were located in each block using the procedures previously maiticMied, The 
bicjuadratic transformation was aH>lied to each blcx:k and RMS errors were 
calculated. Tcible 5.1 gives the RMS errors for each block. These results 
indicated that blocks Al, A2, and B1 could be overlayed to the desired level of 
accuracy using their associated transfcnnations. Although block B2 did not 
have an RMS error of less than 0.5, it was decided that the data in the block 
wcxild be overlayed using its derived transformation rather than divide the 
blcxik into smaller units or delete it from further analysis.^ 

To facilitate the develcpmait of the statistics for the SAR data, the 
blocks of overlayed data were combined into a single data set (i.e, , to 
simulate the original flight line) > The reconbining of the blcxjks was 
accxmplished by visually locating overlapping points and reassigning the 
starting line and column lcx::ations. 


^After the data was overlayed, it was determined that the registration of 
blcx:k B2 was extremely poor and at this point it was deleted from further 
analysis. 
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Tiable 5.1. Results of the biquadratic transformation for the four blocks of 
SAR data. 


Block 

Maximum Acceptable 
Linear Error 

Overall RMS Error 

£q1udh 

Iksrber of 
Aoc^ed 

ChesKpoints. 

A1 

1.5 

0.484 

0.487 

20 

A2 

1.5 

0.425 

0.491 

20 

B1 

1.5 

0.486' 

0.488 

21 

B2 

1.9 

0.639 

0.864 

15 


3. Gecmetric A^ustment 

After the registration process, a second SAR data set was produced having 
a reduced spatial resolution of 30 m. The purpose of this was two-fold; 1) to 
match the spatial resolution of the simulated Thematic Mapper data set, and 
2) to reduce the amount of speckle associated with the SAR data. The spatial 
resolutiai was degraded by averaging pairs of neighboring pixels together. 
Since the original digitized SAR data set had a spatial resoluticxi of approxi- 
mately 15 m, the averaging of cells of four pixels produced a degraded data set 
having a spatial resolution of 30 m. A separate data tape was then constructed 
for the 30 m SAR data set. The steps and considerations used to degrade the 
spatial resoluticai were similar to those used for the MSS data (Latti^ 1981) . 
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C. Image Interpretation Results 

Various forest c»ver types were identified on color infrared c^iotografAy 
taken at the same time the SAR data were obtained. The fc»:est cover types 
identified on the aerial phot;ography included old growth mixed hardwood, second 
growth mixed har<i«?ood, water tupelo, and pine (primarily slash and loblolly 
pine) . In addition, there were areas where the forest had been clearcut, as 
well as pasture areas, crop land, areas of es^sed agricultural soil, and water 
features that were identified on the jhotography. 

Following the photo interpretation, stands of the various forest and other 
cover types were located on both polarizaticxis of SAR imagery. The two 
polarized images then were analyzed to determine if tonal and/or textural 
differences existed between the cover types. The tonal characteristics were 
determined by evaluating the relative speckle for each cover type. The tonal 
and/or textural differences between the HH and HV polarized images then were 
compared and evaluated for each cover type. An attempt was made to determine 
wl:y particular differaices did occur. 

The initial analysis of the SAR imagery depicted a banding effect which 
was particularly noticable cxi the HH image, A much more subtle tonal variation 
that seemed to be related the range angle could be observed, particularly cxi 
the HV image. Both of these effects can be observed in Figure 5,1, wMch shows 
the data for both polarizaticxis of the entire flight line. Both effects had a 
significant impact on the ability of the interpreter to determine various cover 
types using the radar imagery alcaie. Both the banding and tonal variation 
effects were not due to any characteristics of the ground terrain, but were due 
strictly to variables inheroit in this particular data collection and 
processing system. Both effects were also quite evident on several other data 
sets obtained at the same time over the other flight lines. It should be 



no 


pointed out that the overall lack of contrast in the HV in&,gery may have been 
due to the parameters involved in obtaining euid processing this particular data 
set and not necessarily an inhereit characteristici of HV polairized imagery. 

Deciduous forest cover appears to have a characteristic light tCMie on the 
HH image, whereas on the HV image these deciduous areas have a darker toiie. 
Diis was most evident in the area of the alluvial plain where dense deciduous 
forest cover was located (see Figure 5.2). Ihe dense deciduous forest stands 
located in small ravines were identified on both polarizations die to their 
distinctive spatial patterns (see Figure 5.3). Ttjese patterns were highlighted 
because of the high response given by the deciduous forest cover growing within 
the ravines and perhaps also highlighted in part by the slcpes of the ravines 
per jgg acting as angular reflectors. Due to the contrast difference between 
the two polarizations these patterns were more distinctive on the HH image than 
on the HV image. 

One of the most distinct differences observed in the imagery was a 
difference between deciduous and coniferous forest cover that could be observed 
as a function of polarization. As shown in Figure 5.2, there is very little 
difference between deciduous and coniferous forest an the HV image. On the HH 
image however, the deciduous forest cover has a distinct light tone whereas the 
coniferous forest cover has a relatively dark tone. Thus, deciduous and 
coniferous forest cover can be easily separated on the HH imagery due to the 
distinctive tonal differences, even though these cover types are very difficult 
to separate on the HV imagery. 

Other features such as older clearcuts and fields having emergent vegeta- 
ticn tend to look very similar in both tone and texture on both polarizations. 
Although recent clearcuts are very dark in tone in both polarizations as 
compared to the surrounding forest cover, they are easier to separate from 
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Figure 5.3. Example of radar imagery indicating distinct appearance of vegetated 
ravines on HH polarization. 
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coniferous and mixed cover types on the HV imagery. Water and smooth bare soil 
features have a distinctive black appearance on both polarizations to the 
specular reflectance of the anitted radar signal away from the antama. 
However, by using the shapes and speckling characteristics of some agricultural 
fields, water and fields with bare soil usually can be separated. 

It should be noted that of the features identified on the color IR photo- 
grajhy, several could not be identified on Hie SAR imagery. Old growth and 
second growth hardwood stands could not be separated. Water tupelo was very 
ea^ to identify on the color IR photography because of its distinctive color, 
but could not be identified at all on the SAR imagery. Table 5.2 surmarizes 
the tonal and textural characteristics of the various forest and other cover 
types examined in this stu^. Examples of the tonal and textured character- 
istics are illustrated in Figure 5.4. A more detailed characterization of the 
appearance of the various cover types in each p»larizaticai is shown in Table 
5.3. Table 5.3 is an expanded version of the sumnary in Table 5.2, and 
provides additional information concerning the variability in appearance of 
some of the cover types. 

In summary, the qualitative analysis of the dual-polarized SAR imagery 
showed that certain forest cover features are more easily idaitified in one 
polarization than the other, while many non-forest features look very similar 
in both polarizatiois. Discriminating between coniferous stands and det^iduous 
stands was easier on the HH image than on the HV image. However, this does not 
infer that the HH polarized image is better. I5ie shadow and edge effect due to 
extreme differences in vegetation height help delineate the boundaries of 
clearcuts, and are much more prevalent on the HV image. Neither polarization 
is consistently better for idaiti^ing the various forest cover typjes examined. 
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Table 5.2. Tone and texture characteristics of various cover types in relation 
to polarization of the radar imagery. 


Tone' 


1 / 


Textured 


Cover Type 

Ifii 

m 

m 


Hcu:dwood 

white 

light gray 

grainy 

grainy 

Pine 

dark gray 

gray 

speckled 

speckled 

Nixed Pinc-Hardwood 

dark gray 

gray 

grainy 

speckled 

Clearcut 

dark gray 

dark gray 

grainy 

grainy 

Bottomland scrub 

dark gray 

dark gray 

speckled 

speckled 

Pasture 

dark gray 

dark gray 

grainy 

grainy 

Ehiergetit Crops 

dark gray 

dark gray 

grainy 

grainy 

Bare Soil 

black 

black 

smooth 

smooth 

Water 

black 

black 

smooth 

smooth 

^Tone: (A) black; 

(B) dark gray; (C) 

light gray; 

(D) white 



^Texture: (1) smooth; (2) grainy; (3) speckled 


(These letters or numbers indicate the examples of these descriptiois shown 
in Figure 5.4) 


The following points summarize the results obtained diring the analysis: 


1. Deciduous forest cover is easily identified on the HH image due to a 
distinctive light tone, whereas on the HV image these areas have a 
darker tone. (Figures 5.2 and 5.3) 

2. Ccxiiferous forest cover is dark in tone on the HH image and is some- 
what lighter in tone on the HV image. (Figure 5.2) 

3. Deciduous and coniferous forest cover are easily separated on the HH 
image due to their distinctive tonal differences, but are difficult to 
separate on the HV image. (Figure 5.2) 

4. Dense deciduous forest stands located in ravines are easily identified 
on both polarizaticHis because of the tc^xjgrajhical pattern being high- 
lighted by the response of the deciduous stands and partially high- 
lighted ty the slopes acting as angular reflectors. These patterns 
are more distinctive on the HH image than on the HV image. (Figure 
5.3) 
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5.3. Descciptions o£ cover types identified on X-^and SAR imagery. 




Tone 




Cover Type 

HH 


ItV 

Texture 

Coonents 


Hardwood 

Very light In tonej 

Medium In tone; 

Some speckle present 

Shadow will appear to the west of 

1 


light gray to white 

gray on near look 

on HH; slightly 

stands and edge reflections will 

k 


on Imagery 

aide, light gray 

smooth to grainy. In- 

appear on the east side of stands, 

1 



on far look side 

crease In speckle 

If non-f crested land Is adjacent. 

if 




on HV. 

Most stands appear around drainage 
ways, water ways or on bottom land. 

1 





Somewhat Irregular In shape. 

1 

Regenerating 

Gray to light grayj 

Gray throughout 

Grainy to speckled 

If forested land Is adjacent to the 

1 

%t 

Hardwood 

some areas may 

area 

on both the HH and 

clearout areas, the east side will 


appear almost white. 


HV Images 

be In shadow while edge reflections 
will appear on west side. Usually 

\i ■ 
1 





Irregular In shape and may have 
roads leading to stands. Blocks 
of trees may also be present within 

js 

K 

II 





eleareut area. 

1 ■ 
1- 

Recent 

Dark In tonej dark 

Varies In tone; 

Grainy; may have re- 

Same as Regenerating Hardwood. 

1 

Clearout 

gray on Image 

dark gray (almost 

latively large white 

i 



black) to light 

patches within area. 


11 

Pine 

Dark gray; young 

Gray In tone; 

Speckled; similar on 

If non-forested land Is adjacent to 

I 


stands and mature 

young stands 

both young and mature 

the clearout areas, shadows will 



stands similar In 

appear to be dark- 

stands. 

appear to the west of stands and 

n . 


tone. 

er In tone than 


edge reflections will appear on the 

i 



mature stands. 


east side of stands. Usually Irreg- 
ular In shape and may have roads 
leading to stands. 

1 

I 

i 

1 

Pasture 

Dark gray through- 

Dark gray to gray 

Somewhat grainy on HH 

Somewhat regular In shape; If sur- 


out field. 

In tone. 

to a more speckled 

rounded by forested land, the east 

1 




appearance on HV. 

side will be In shadow and edge re- 
flections will appear on the west 
side. Individual trees may be pre- 

1 





sent within the field. 

i 

u 

Bare Soil 

Black to dark gray 

Black to dark gray 

Fairly smooth to some 

Regular In shape. If surrounded by 

1 

■ i 


In tone. 

In tone. 

gralnlness; depends 

forested land, edge reflections 

1 




on TOW direction or 
emergence of crops. 

will appear on west side. 

1 

1 

j: 

Crop 

Light gray to white 

Light gray to 

Smooth to grainy de- 

Same as Bare Soil. 


In tone . 

white In tone. 

pending on the amount 
of crop cover present. 

. 

1 

I 

1 

w 

Water 

Black In tone. 

Black In tone. 

Smooth. 

Irregular In shape (lakes) or very 
curvollnear (rivers). Edge reflec- 
tion will appear on west border. 

t 

i 

j 

1 

Urban 

Light gray with some 

Gray with some 

Very speckled which 

No definite boundary; many roads 


white splotches. 

white splotches. 

deoreases as one 

converging In the same general 

i 




moves away from the 
center of urban area. 

vicinity. 

1 


5. Older clearcuts and fields having emergent vegetaticm tend to look 
very similar in both tone and texture on both polarizations. (Figure 
5.2) 

6. Water and smooth bare soil features have a distinctive black 
appearance on both polarizations due to the i^cular reflectance of 
the emitted radar signal away from the ant^:uia. (Figure 5.2) 

7. Ti^lo stands could not be distinguished from the surrounding hardwood 
forest on either the HH or HV imagery. 

8. Differences in stand deisity and size class of forest stands could not 
be defined on either the HH or the HV polarization of the SflR data. 

9. There is a distinctive banding effect on the HH image and a tonal 
variation related to range angle on the HV image which impact the 
ability of the interpreter to determine various cover types. These 
effects were also evident on other data sets of different flight 
lines. (Figure 5.3) 
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D. ciflssif icatiga fiesulte 

Hie next phase of tlie analysis involved conpitec classification of the SAR 
data. It was hoped that such a qucintitative analysis might allow differ aitia- 
tiai among cover types that could not be separated visually. Another objective 
was to determine if con^xiter-aided analysis techniques that had originally been 
developed for MSS data could be effectively utilized with SAR data. In 
cuSdition, the effectiveness of the SBCHD classifier was to be evaluated for 
potential use with the SAR data, since this classifier utilizes both the 
"spectral" and spatial (e.g., radar speckle) information content in the data. 

ZXte to the unique characteristics of the SAR data (as evidenced in part by 
the coherent speckle) , a supervised classification was performed. In order to 
conpare the SAR results with a classification of the INS data, training and 
test fields were identified in both data sets throughout the area south of the 
city of Camden. On the 30 m data, this area consisted of 300 by 250 pixels, 
representing an area of 6 by 5 miles. 

Both the training and test field locatiais were ideitified using the 
OOMTAL Vision/20 (a digital image display device) . To ideitify enough fields 
throughout the data set, each training and test field was limited in size to 
the "average" field size. The average field size was determined for each cover 
class by calculating the total area of each cover class and then determining 
the number of tracts of land that were represented by that cover class. 

After identifying fields within each cover class, the fields were randomly 
divided into their training and test groups. Hie training fields were then 
divided into spectral classes within each cover class, if possible, based on 
the tonal variation within each cover class. Histograms were developed to 
determine if there were a sufficient number of training samples to accurately 
represent each spectral class. Statistics (i.e. , mean vectors and coveiriance 
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matrices) were calculated for each ^)ectral class f<»: use by the classification 
algorithms. 

Since the StR data had a distinct tonal variation across the flight line 
on the HV image (due to system characteristics) , a statistical evciluation was 
performed to determine if the SAR training data should be separated into 
spectral classes based on the location of the individual fields across the 
flight line. To determine the significance of the tonal variation across the 
flight line^ the flight line was first divided into six discrete strips. 
Fields of the dominant forest cover class, which was the harA/ood class, were 
identified within each strip and their means and standard deviations 
calculated. Figure 5.5 illustrates tlie means and standard deviations for each 
strip for both the HH and HV channels. From this figure it is shown that the 
means are fairly uniform across the strips on the HH polarizati orr.. However, 
the means of the individual strips are increasing across the flight line on the 
HV polarization, thus graphically illustrating the tonal variation previously 
observed in the cross-track direction on the HV imagery. 

An analysis of variance was performed on the data to determine the 
significance of the tonal variation. The means of the strips for the HH image 
were found not to be significantly different at a = 0.05. However, the strip 
means of the HV image were found to be significantly different, therefore, 
based on the Duncan's Multiple Range test, those fields which had column 
coordinates less than 240 on ttie 15 m SAR data and 120 on the 30 m SAR data 
were grouped into one set of "spectral" classes and those fields whose 
coordinates were greater were grouped into a second set of "spjectral" classes. 
Table 5.4 shows, quantitatively, the differences in means and variances for the 
various cover t/p)es due to these look angle effects. Table 5.5 lists the 
number of "sp>ectral" classes associated with each cover class (combined for 



HH Polarization 
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Strip Number 


HV Polarization 


Strip Number 


Plotted means and standard deviaticais for each strip for both the 
HH and HV polar izaticais using the 15 m SAR data set. 
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Table 5,4. Means and Standard Deviati<»is for each cover class for both the 
left and right portions (i.e,, spectral classes) of the 1980 SAR 
data sets. 





15 

m 



30 m 





HH 


HV 


HH 


HV 

CXagg 


Left 

. .. .Right 

Lgft. 

—Right 

Left 

— Right 


Right 

SOIL 

X 

6.4 

13.8 

6.8 

16.6 

6.7 

13.4 

6.7 

17.0 


s 

2.7 

6.2 

3.3 

10.7 

1.9 

4.4 

1.7 

8.7 

CHOP 

X 

22.1 

14.6 

26.9 

18.1 

21.9 

15.4 

26.3 

19*0 


s 

11.4 

8.4 

17.3 

14.4 

7.6 

6.6 

12.3 

11.1 

HDND 

X 

42.4 

40.7 

44.0 

52.5 

43.4 

41.1 

44.0 

53.2 


s 

21.7 

21.6 

32.6 

38.1 

15.0 

14.1 

21.5 

25.1 

RGHD 

X 

33.4 

34.9 

37.2 

56.3 

33.4 

34.4 

36.9 

56.3 


s 

16.6 

16.6 

22.9 

33.2 

11.0 

11.0 

14.5 

19.5 

PINE 

X 

10.4 

14.4 

19.4 

39.1 

10.8 

14.6 

20,0 

39.2 


s 

5.3 

6.9 

11.8 

23.1 

4.2 

4.7 

8.2 

14.1 

PAST* 

X 


13.4 


42.2 


14.0 


43.1 


s 


6.8 


24.6 


5.5 


16.7 

WATR 

X 

3.8 

4.3 

6.2 

6.9 

4.6 

5.0 

6.4 

7.5 


s 

1.3 

2.0 

3.8 

2.4 

2.6 

3.0 

3.0 

1.9 


*The pasture class only had representative fields on the right portion of the 
flight swath. 


Table 5.5, Tlie number of spectra classes, txaining pixels, and test pixels associ 
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both sides o£ the flight line)^ and the numbers of pixels involved in the 
training and test da.ta of both the SAR and MSS data sets. 

Ihe classification of the SAR data was, of course, limited to the two 
channels of data available (i.e., the two polarizations). Three different 
classification algorithms were tested — the GML (Gaussian Maximum Likelihood) 
classifier, the Per**Field classifiei', and the SBCHO (Supervised Extraction and 
Classification of Hanogeneous Objects) classifier. Die latter two are both 
contextual classifiers, in that they base the classificaticxi decisicxi on both 
the mean and the variance of the spectral response over an area (a training or 
test field defined by the analyst in the case of the Per-Field classifier, or 
the "Homogeneous Object" defined by the algorithm in the SEGHO classifier) . In 
addition, both the 15 m and the 30 m SAR data sets were classified in order to 
evaluate the effect of spatial resolution on the SAR data. The 30 m SAR 
results were then compared to 30 m IMS data results in order to evaluate the 
effectiveness of the SAR data as compared to the IMS data. 

The SAR 15 m data was classified using each of the three classification 
algorithms, and the results are givai in Table 5.6 below. Figure 5.6 
graphically depicts the overall classification results for the three 
classifiers. Die overall differences between tlie three classifiers were 
significantly different, and, as shown in Figure 5.5, the classifiers that use 
spatial as well as spectral information (i.e., the PER-FIELD and SECHO 
classifiers), increased the overall classification performance l;iy a factor of 
almost two as compared to the GML per-point classifier. However, the overall 
performance for all three classifiers was rather low. On a class by class 
basis, the results are rather mixed. Hardwood, regenerating hardwood 
(previously clear cut areas) , crop and soil have much higher performances for 
both the PER-FIELD and SBCHO classifiers than were obtained using the OIL 
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l%ible 5.6. Test field classification results 
for the SAR 15 m data.^ 




Classifier 


Cover Class 

GML 

PERrPIEU) 

SEXSO 

Fine 

45.7^ 

37.4® 

52.9^ 

Hardwood 

37.2® 

93.6^ 

99.4^ 

Regen. Hdwd. 

28.3® 

70.1^ 

57.9^ 

Pasture 

25.1*^ 

48.8^ 

16.0® 

Crop 

19.9® 

35.3*^ 

33.4° 

'Soil 

50.1® 

93.6^ 

94.1^ 

water 

83.9^ 

82 .6^^ 

58.0® 

Overall 

35.7® 

68.4^ 

64.3^ 


^Different superscripts indicate significantly 
differoit classification performances betweoi 
the classifiers, based on a Ne»Anan-Keuls 
confsarison with <x= 0.10. 


classifier, nowever, although the performance for hard«rood and soil was very 
high for both of the contextual classifiers, the performances for crop and 
pasture were low. The other cover types had mixed performances between the 
classifiers, and their performances were generally very lew. For pine and 
pasture, the poor performances were attributed to the fact they had very 
similar radar returns and the classificaticxi algorithms could not discriminate 
between these two classes.^ This similarity can be seen in Figure 5.7, which 
shows the mean ± one standard deviation of the radar return for each of 


^j^ipendix D contains performance tables showing caimissicai and omissicxi errors 
between cover types for all data sets and classifier combinations discussed 
in Section V of this report. 



Overall Classification Performance 
(% Correct) 
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Figure 5. 
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fiers using the 15 m SAR data. 
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"spectral" training classes defined. Many "spectral" classes within a single 
cover type, as well as different cover types clearly had very similar radar 
returns in both polarizations. 

The water class had fairly high classification performances for both the 
GML and PER-FIEaj) classifiers, but a much Icwer performance for the SEX3K) 
classifier. This poor performance the SEICHO classifier was due to the 
algorithm, and more specifically, the "moving window" portion of the classifi- 
cation process. Since the majority of the water class was comprised of the 
Wateree River and the river is approximately 70 m in width, then the width of 
the river was represented only six pixels for the SAR 15 m data set. The 
moving window was three pixels wide and thus, many times included boundary 
pixels. The resulting radar return recorded within the 3 pixel x 3 pixel 
window could be similar to that of other classes, resulting in misclassifica- 
ticns. 

After the 15 m SAR data had been classified, the same three algorithms 
were used with the 30 m SAR data, i^roximately the same areas in both data 
sets were used for training and test fields. The test field performarices of 
the SAR 30 m data for the three classifiers are shown in Table 5,7 and in 
Figure 5.8. These results show that both the SBCHO and PER-FIELD classifiers 
performed significantly better than the GML classifier. All three overall 
classification performances were found to be significantly different from each 
other. As seen in Table 5.5, the hardwood cover class had a very high 
performance for both the PER-PIEIjD and SEJCHO classifiers, and the hardwood, 
regenerating hardwood, pasture, and crop classes all had much higher classifi- 
caticai accuracies for both the PER-FIEIJ) and SEICHO classifiers than the OIL 
classifier. However, the pine cover class had a very low classificaticxi 
performance for the SBCHO classifier. This was attributed to the large number 
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Table 5.7, Test field classification results 
for the SAR 30 m data,**^ 


■Claggifjgr. 


Cover Class 

GML 

PER-FIELD 

SECHO 

Pine 

65.5^ 

90.4^ 

53.8® 

Hardwood 

52.6^ 

93. 3^^ 

97.9^ 

Regen. Hdwd. 

45.0® 

66.1*^ 

63. 6*^ 

Pasture 

19.7® 

41.9^ 

43.6*^ 

Crop 

85.8® 

34,6*^ 

50.9*^ 

Soil 

71.0^ 

46.4® 

65.0^ 

water 

62.7^ 

70.8*^ 

39,8® 

Overall 

45.9® 

63.3*^ 

65. 8*^ 

■^Different superscripts 

indicate significantly 


different classification performances betweai 
the classifiers r based on a Newroan-Keuls 
coirparison with a= 0,10, 


of pine test pixels that were classified as pasture (see i^^^jendix D) . All 
three classifiers performed poorly in discriminating pasture and pine from each 
other, with the CML classifier having a particularly low accuracy for pasture. 
Because the radar returns for the soil and water classes were very similar (as 
shewn in Figure 5.9), there was considerable confusion betweai these two 
classes. The low PCXJ performance for the water class using the SEX3HO 
classifier Was again due to the "window size" utilized in the SECHO classifier, 
as well as the spatial resoluticai of the pixels. In coirparing Figures 5.9 and 
5.7, it is clear that the degradation of the spatial resolution to 30 meters 
caused a distinct decrease in the veuriance of the radar returns for most of the 
"spectral" classes involved in these classifications, which should cause a 
higher classification performance for the 30 m data when using the GML 
algorithm. 


Overall Classification Performance 
C% Correct!) 
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5.8. Overcdl test field classification performances for three classi- 
fiers using the 30 m SAR data. 
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Figure 5.9. Coincident spectral plot (means plus and minus one standard deviation) for all cover 
type "spectral" classes for the 30 m SAR data. 
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The overall POC performances for the 15 ra SAR and 30 in SAR data using the 
three classifiers are con^aared in Figure 5,10, The results of the statistical 
evcduation between the data sets are given in Table 5,8. The overall classifi- 
cation performances betweei the two data sets were found to be significantly 
different for the GML and PEEl-FIEIJ) classifiers, but they were not signifi- 
cantly different for the SECHO classifier. 

For the GML classifier, these results show that overall performance tends 
to increase by degrading the spatial resolution, as anticipated. This is 
because the spectral variability associated with each cover class is reduced in 
the 30 m data, and the amount of overlap between the "spectral" distributions 
is therefore reduced, thus reducing the probability of raisclassification. 

The comparison of the two data sets for the PER-FIEID and SECHO classi- 
fiers show that the overall results are rather similar. With the performance of 
the 15 m SAR data set being slightly higher than the 30 m SAR data set when 
using the PER-FIELD classifier. These results would tend to indicate that by 

Table 5,8, Statistical ccm^rison between the 
overall classifications of the 15 m 
and 30 m SAR data sets, for each 
classification algorithm.'^ 


Data Set 


Claggjfier 


fiftR 30 m 

GML 

35.7^ 

45.9^ 

PER-FIEID 

68,4^ 

63.3® 

SECHO 

64.3® 

S5.8® 


^Different superscripts indicate significantly 
different classification performances betweai 
the data sets, based on a Newman-Keuls 
con^arison with a = 0.10, 
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degrading the spatial resolution, overall classification performances may not 
increase when using contextual classifiers. However, because both contextual 
classifiers performed much better than the GML classifier with either spatial 
resolution data set, it would suggest that algorithms that incorporate both 
spectral and spatial information in the classificaticxi decision will produce 
significantly increased classification performances when using SAR data* 

The classification performances by cover class for the three classifiers 
examined and for both the 15 m and SAR 30 m data sets are shown in Figure 5.11. 
The hardwood (HIM)) class has a high classification performance for both data 
sets using both the PER-FIELD and SHOD classifiers. Also, the crop and 
regenerating hardwood (RGHD) cover classes had higher performances using either 
of the textural classifiers than when the OIL classifier was used. Such 
results would be es^cted, since hardwood, regenerating hardwood, and crop 
cover classes all had relatively large "spectral" variances in the SAR data (as 
shown in Figure 5.7 and 5.9), and both the PER-FIEID and SBCHO classifiers can 
incorporate this information along with the spectral information to better 
separate the "spectral" distributions. However, the classification 
performances of the regenerating hardwood and crop classes were relatively low 
for all three classifiers due to misclassification with other vegetation 
classes having similar "spectral" distributions. . 

The cover classes pine, pasture, soil, and water had irregular patterns of 
classification performances. As previously maitioned, pine and pasture had 
similar levels and distributions of radar return, in spite of the significant 
physical differences between these tyo cover types. The similar radar data 
values caused considerable confusion and misclassification between these two 
cover types for all three classifiers. 



Classification Performance Classification Performance 

(% Correct) (% Correct) 
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Figure 5.11. Classification performances by cover class for the three 
classifiers, and for both the 16 m and 30 m SAR data sets 
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For the 30 m data, soil had a somewhat higher classification performance 
with the GML classifier than with either contextual classifier. However, when 
using the 15 m data, soil had a much higher classification performance for the 
PEK-FIBLD and SBCHO classifiers than the GML classifier. The pixel-to-pixel 
variation in the 15 m data set was a^^rently very useful in helping the 
contextual classifiers to idaitify bare soil correctly. By degrading the 
resolution, the amount of pixel-to-pixel variatiai was reduced within each 30 m 
pixel in the fields of bare soil. 

The 30 m IMS data covered approximately the same area as both the 15 m and 
30 m SAR data sets. The training and test fields were generated using 
procedures that were similar to those used for the S/IR data and representing 
the same cover types. However, in some cases, the field locaticxis for a 
particular cover type were not the same between the SAR and IMS data sets due 
to changes in the cover type (e.g. , bare soil to crops) as a result of 
differences in data collection dates (i.e, , SAR = June 30 versus TMS = August 
29, 1980). Eight channels were available for classification; however, only the 
best three channel conhination was used in the classification. Channels 3, 5, 
and 8 (0.63-0.69 ym, 1.00-1.30 ym, and 10.4-12.8 ym, respectively) were 
identified as the best three channel catibination using divergence as the 
separability measure between all possible ccmibinations for the given spectral 
classes. 

The overall and cover type classification performances for the three 
classifiers using the 30 m IMS data is given in Table 5.9. For all three 
classifiers, the overall classification performances were greater than 90 
percent and were found to be significantly different. These results indicate 
that for a limited area and for the given cover classes, a reasonable classifi- 
cation of the test could be performed using only three channels of TMS data. 
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Table 5.9. Overall and cover class classification 
test performances for each classifier, 
usii^ the 1980 30 m data (supervised 
tredning statistics) 




Classifier 


Cover Class 

GML 

PER-FIEID 

SBCHD 

Pine 

75.4^ 

73.9® 

75.4^ 

Hardwood 

91.2® 

100.0® 

96. 9*^ 

Regen# Hdwd. 

86.7® 

89.6® 

89.1® 

Pasture 

87.1® 

94.1*^ 

91.5®^ 

Crop 

95.3® 

100.0*^ 

95.1® 

Soil 

99.3® 

100. 0*^ 

97.6® 

Water 

94.8® 

93.8® 

99.5® 

Overall 

91.1%® 

96.5%® 

94.3%® 


^Differ&it superscripts' indicate significantly different 
classification performances between the classifiers, 
based on a Newman-Keuls comparison with a = 0.10. 


In addition, the overall classification performances for both the PER-FIELD and 
SEICHO algorithms were significantly higher than the Q5L performance. This 
again enphasizes the point that by using additional information (i.e, , 
texture) , classification performances can be improved. 

The overall classification performances for the 15 m SAR, 30 m SAR, and 
30 m IMS data sets using the three classifiers are givei in Figure 5.12. The 
statistical comparisons, by cover type and for the overall classification 
performances are given in Table 5„10. For all classifiers, the 30 m TMS data 
set performrad significantly better than either the 15 m or 30 m SAR data sets. 
This was found both for the individual cover types and for the overall 
classificatich conparisons. However, in evaluating these results oonparing SAR 
and TMS data, one must keep in mind that the classification of the SMI data 
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Figure 5.12. Overall test classification performance for the 15 m SAR 
30 m SAR, and 30 m IMS data for the three classifiers. 
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involved only two channels of data of a single wavelength. If SAR data from 
one or two additicml wavelengths were avai.lable, it is conceivable that the 
SAR data could provide results as good as or better than those obtained with 
the *018 data. 

Ohe najor results for the quantitative analysis of the SAR data can be 
sumnarized as follows: 


1. The HH and HV polarized data sets had independent geometric distor- 
tions which required special preprocessing techniques to successfully 
digitally overlay the two sets of data, 

2. Significant improvements in overall classification i^rformances were 
achieved using both the PER-FIELD and SBCHO classifiers Versus the GML 
per-p7int classifier for both the SAR and IMS data sets, 

3. Pine and hardwood cover classes could be reliably differ^tiated on 
the SAR (as well as the IMS) data. 

4. Pine and pasture cover classes, and fe^sre soil and water cover classes 
were consistently confused with each other on this X-band SAR data. 

5. There were statistically significant differences in radar return 
across the flight-line due to look-angle effects for many cover types, 
particularly in the EIV polarized data. 

6 * Degrading the spatial resolution of the SAR data (from 15 m to 30 m) 
caused the overall percent classification performance for the GML 
per-point classifier to increase due to the better separation of the 
probability density functions associated with some of the cover types. 
However, degrading the spatial resolution, had either no effect or a 
negative effect on the overall classification performance of the 
contextual classifiers (i.e. , PFE-FIELD or SEX3K)) . 

, The various threshold parameters (i.e., window size, homogaieitYf and 
annexation) used in the SBCHO classifier are data dependent and are 
strongly influenced the size, shape, and textural characteristics 
of the cover types being classified. 


7 
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A. Sunmary 

tXiring the course of this investigation, the qualitative and quantitative 
analysis of both the Thematic Mapper Simulator (IMS) and the Synthetic Aperture 
Radar (SAR) data produced a nunher of results and conclusions, which can be 
sumnarized as follows; 


Spatial Resolution stuOy 

1. Ihe use of successively higher spatial resolution date resulted in 
lower overall classification accuracies when classifications were 
ccxifducted with a "per-point" GML classifier. 

2. Higher classification accuracies were achieved with the "per-point" 
classifier when using 60 x 75 meter (as exposed to finer) spatial 
resolutiCNi data in cover classes associated with relatively high 
levels of spectral variability across adjacent pixels (i.e., old-age 
hardwood, second growth hardwood, pine forest, and clearcut areas) , 

3. Differences in classification accuracies achieved with data of 
differoit spatial resolutions were not significant (a = 0,10) for 
cover classes associated with relatively low levels of spectral 
variability across adjacait pixels (i.e., pasture, crops, bare soil, 
or marsh vegetation) , 


wavet?flnd,.EvaIvfttipn £tedy 

1. use of four wavelength hands produced considerably better classifica- 
tioi results than when only two or three wavelength bands were 
utilized. 

2. Maximum overall classification performances were obtained when all 
wavelength bands v/ere utilized. 

3. The increase in overall classification performance when more than four 
wavelength bands were utilized was minimal, therefore indicating that 
an appre^riate set of four wavelength bands provides the best 
confcination of relatively high classification accuracy and minimal 
computer time. 

4. Classifications using the 1979 data set and various three and four 
wavelength band combinations indicated the importance of both the 
visible and near- infrared portions of the spectrum for accurately 
classifying various forest and other cover types, 

5. These results, which were primarily focused on differentiation of 
various types of healthy vegetative cover, did not indicate any 
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particular advantage for using wavelength bands in portions of the 
spectrum beyond those to which Silicon detectors (used in Multi-Linear 
Array ^stems) are sensitive. 

6. The Si^rvised method of developing training statistics provided 
slightly better overall classification results than the Multi-Cluster 
Blocks technique for both the 1979 and 1980 data sets. It would 
aj^ar that for situations where accurate, reliable reference data 
(i.e. , "ground truth") is available over the entire study area and for 
data having fine spatial resolution, the Supervised technique is 
generally best. It is particularly useful for waveband evaluation 
studies involving different cover t^i)es. 

7. Overall classifioition accuracies bsised on the "best 3" wav^aands 
defined ^ the average transformed divergence values were signifi- 
cantly higher than those based on the "best 3" wavdjands defined by 
tJie minimum transformed divergence values. 


■Canpar.i^iSCn Anong Classification Algorithms 

1. The SExmo algorithm consistently resulted in higher overall classifi- 
cation performances than were cA)tained with the GML algorithm, 
regardless of the data set or training statistics being utilized. 

2. The L-2 Minimum Distance algorithm produced significantly less 
accurate classif ications than were obtained using ei^er the GML or 
the SBCHO algorithms. 

3. Overall classification performances of 85-90%, based on test data 
sets, were obtained for both the 1979 and 1980 TOS data when four or 
more wavelength bands were utilized in conjunction with the SEJCHO 
classifier and either the Supervised or Multi-Cluster Blocks training 
statistics . 

4. Phenological effects caused distinct differences in spectral response 
for some cover types, especially tupelo, when comparing the 1979 and 
1980 data. 



^oniponents o r. Karhunen-Loeve (K-D Transformation of \he WS Data 


1. The K-L transformation (with 4 ccaiponents) significantly decreased the 
overall classification performance for both the GML and SEX3K) 
classifiers, but the overall classification for the L-2 classifier was 
generally increased, 

2. For individual cover types, the GML and SBCHO performances tended to 
be rather similar (both would either increase or decrease by a similar 
amount for a particular cover class with a K-L transformation) but the 
L2 classifier tended to react in the opposite way; i.e,, when the GML 
and SECHO classification cover class performances decreased with a K-L 
transformation, the L2 increased, and vice-versa (with the exception 
of the OCUT and WATER categories) . 
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3. A K-L transfonnation and the L-2 classifier improved all cover class 
performances when using only three channels (i.e,, con^x>nents) and 
most cover class performances when using four channels, 

4. A K-L transformation and the GML classifier proved some (i.e,, half) 
of the classification performances for the individual cover classes 
when using three channels (components) , but when using four channels 
the classification performances were often considerably better with 
untransformed data. 

5. In geieral, it ag^ars that for classifications using fewer number of 
channels (features) than is c^Jtimum for a particular data set (i.e., 
the intrinsic dimensionality of the data, which in this case is four) , 
a K-L transformation will improve overall and most c»ver class 
performances. However, if the number of channels used is equal to the 
intrinsic dim^sionality of the data, the original untransformed data 
appears to provide better class separability and subsequent 
classification performance. 


Qualitative Anal^/sis of tlie SftR lata ; 

1* Deciduous forest cover is easily identified on the HH image die to a 
distinctive light tone, whereas on the HV image deciduous forest cover 
has a darker tone. 

2. Coniferous forest cover is rather dark in tone cai both the HH and HV 
polarizatic»i imagery. Therefors, deciduous and coniferous forest 
cover are easily separated on the HH image due to their distinctive 
tonal differaices, but are difficult to separate on the HV image. 

3. Dense deciduous forest stands located in ravines are easily identified 
CXI both polarizations because tiie topographical pattern is highlighted 
by the response of the deciduous forest cover and also highlighted by 
the slopes which serve as angular reflectors. These patterns are more 
distinctive on the HH image than on the HV image. 

4. Regenerating hardwood stands and fields having emergent vegetation 
tend to lcx)k very similar in both tone and texture cxi both polariza- 
tions. 

5. Pine stands and pastures are both rather dark in tone in both the Iffl 
and HV polarizaticais and are therefore very difficult to differ^tiate 
on this X-band SAR data, in spite of the distinct differoices in the 
physical characteristics of these caver types. 

6. Water and smooth bare soil features have a distinctive black appear- 
ance txi both polarizaticais due to the specular reflectance of the 
anitted radar signal away from the antenna. 

7. Tupelo stands could not be distinguished from the surrounding hardwcxid 
forest on either the HH or HV imagery. 

8 . Differences in stand doisity and size class of forest stands could not 
be defined on either the HH or the H\^ polarization of the SAR data. 
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9. In the data set used in this stu<ty, there was a tonal variation 
related to range angle <»i the HV image and a distinctive banding 
effect CXI the HH image which indicted the ability of the interpreter 
to reliably identify various cover types throughout the entire data 
set. Hiese effects were also evidait cxi data sets for other flight 
lines. 



1. The HH and HV polarized data sets had independent geometric distor- 
ticxis which required special preprocessing techniques to successfully 
digitally oi^erlay the two sets of data. 

2. There were statistically significant differences in radar return 
across the flight-line due to look-angle effects for many cover types, 
particularly in the HIV polarized data. 

3 . Since early one wavelength (X-Band) , represented by two channels (HH 
and HV polarizations) of SAR data were available for analysis, overall 
classification performances of caily about 65% were obtained with the 
SAR data. It is believed that additional wavelengths of SAR data 
would enable significantly higher classificaticxi performances to be 
achieved, 

4. Significant improvements in overall classification performances were 
achieved using both the PER-FIELD and SEXIHO contextual classifiers 
versus the GML per-point classifier for both the SAR and TMS data 
sets. 


5. Pine and hardwood cover classes could be reliably differentiated on 
both the SAR and OMS data, 

6. Pine and pasture cover classes, and bare soil and water cover classes 
were consistently confused with each other on this X-band SAR data, 

7. Degrading the spatial resolution of the SAR data (from 15 m to 30 m) 
caused the overall percent classification performance for the GML 
per-point classifier to irorease due to the better separation of the 
probability density functions associated with some of the cover types. 
However, degrading the spatial resolution had either no effect or a 
negative effect on the overall classification performance of the 
contextual classifiers (i.e. , PER-FIE5ID or SE)CHC} . 

8. The various threshold parameters (i.e., window size, homogeneity, and 
annexation) used in the SBCHO classifier are data dependent and are 
strongly influenced by the size, shape, and textural characteristics 
of the cover types being classified. 


In conclusion, although Thematic Maj^r data will undoubtedly be better 
than the current Landsat data from a mensurational stanc^int, these 
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preliminary results — which showed a decreased classification performance with 
higher (e.g. , smaller) spatial resolutiai — tend to indicate that oonventioial 
per-point classification techniques may not be effective when using higher 
resolution data, particularly for areas involving classification of forest 
cover. Thus, classification techniques such as SBCHO (which utilizes the 
spaticd variability in additic»i to the mean spectral response of an entire 
forest stand or agricultural field) , need to be further tested and refined for 
use with Thematic Mapper data. 

The results of this investigation indicated that the Sijpervised technique 
for developing training statistics and the Sample Block Test Data approach for 
defining a statistically valid set of test data were effective, and that the 
average Transformed Divergence — based on the "best" four wavelength bands — 
defined by the Feature Selecticn processor in LAEISYS enabled an optimum sub-set 
of wavebands to be defined. Use of fewer than four wavelength bands resulted 
in significantly lower classification performances, while more than four 
wavelength bands did not cause significant inprovements in overall classifica- 
tion accuracy^. Likewise, a Principal Components transformation did not prove 
useful for increasing classification performance when either the SBCHO or GML 
classification algorithm were utilized with four channels of data. Conparison 
amc«g different classification algorithms indicated that the SBCHO contextual 
classifier provided the best overall classification results. 

The SAR data could be used to separate some cover types with a high degree 
of reliability, but other cover types could not be adequately separated, ev^ 
though they were physically very different. The value of multiple frequencies 
(particularly the longer wavelengths) as well as multiple polarizations of SfiR 
data must be assessed in order to develop a better understanding of the true 
capabilities and limitations of SAR data for mapping forest cover types and 
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their characteristics. However, such studies should be conducted using 
digitally — rather than optically — processed SAR data. 


B. Recomnendations 

1. Contextual classifiers (e.g. , SBCHO) , imist be more fully developed and 
evaluated in order to assess the importance of such classifiers for 
□ffectively analyzing higher spatial resolution data such as that 
obtained by the Thematic Ma^^r. 

2, Additional evaluations of Principal Component Transformations should 
be conducted with Thematic Mapper data in order to better assess the 
potential advantages and limitations of such data processing 
techniques in operational situations. 

3. An effective and legitimate methodology for combining errors of 
cotmission and errors of emission is needed in order to provide a more 
meaningful measure of overall classification performance. In 
addition, a statistically valid but economically feasible methodology 
for defining test data sets (such as the "Sample Block Test Data" 
method developed in this study) needs to be tested and standardized 
for use by different researchers using coinxiter-aided analysis 
techniques. 

4, Digitally processed SAR data of multiple wavelengths and polarizations 
should be analyzed to better understand the capabilities and limita- 
tions of the microwave portion of the spectrum for mapping forest 
cover types and characteristics. 
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APPENDIX A (Tables 1-36) 


1979 Waveband E^^aluation Classification Results and 
Statistical Analysis Tables 
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Table 1, Sunmary table of overall classification results^ table location and channel subsets 
of the 1979 Waveband Evaluation: GML algorithm, sample block test data. 


1 2 3 4 5 6 7 

WAVEBAND 0.45- 0.52- 0.63- 0.76- 1.00- 1.55- 10.4- 


(»L 

Training Statistics 


COMBINATION 0.52 0.60 0.69 0.90 1.30 1.75 12.5 Sifiervised 


"Best 2" 


"Best 3" 


"Best 4" 


"Best 5" 


"Best 6" 


Visible 
Reflective IR 


"Best 3 minus 
ISiermBl IR" 


"Best 3 minus 
Middle IR" 


"Best 3 minus 
I'iear IR" 


"Best 3 minus / 
Reflective IR" \. 

Simulated 

Landsat ^ 

Pour channel y 
subsets with one ^ 
channel from uch 
wavelength region 


80.5%(Table 2 )^ 81.5%(Table 15) 


78.4% (Table 3) 


76.0% (Table 16)1 


86.1% (Table 4) 


86.1% (Table 17) 


88.3% (Table 5) 


87 .6% (Table 18) 


89. 9% (Table €) 


87. 4% (Table 19) 
90 .7% (Table 7) 88.7% (Table 20) 
B1.0%'(Table B) 72.2% (Table 21) 
71. 9% (Table 9) 64.6% (Table 22) 
78.4% (Table 3) 


76.0% (Table 16) 


85. 4% (Table 10) 


76.0%(Table 16) 


78.4% (Table 3) 


82.1% (Table 23) 


81.0% (Table 8) 


64 .3% (Table 24) 
88.9%(Table 12) 87.8%(Table 26) 


83.4% (Table 13) 85 .3% (Table 27) 
87.0% (Table 14) 86.4% (Table 28) 


^Table numbers refer to the classification performance tables in T^^endix A of this report. 


Table 2, Waveband Evaluation Classification Results Using Channels 2 & 5 (the best 2) . (1979 Ol-lS 

Data, Supervised Training Statistics, GML Classifier) 
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AVERAGE PERTORMANCE BY COVER CLASS = 532.4/8 = 66.6% 



Table 3. Waveband Evaluation ClassificaticHn Results Using Channels 1, 3, & 6 (the best 3). (1979 

IMS Data, Supervised Training Statistics, GML Classifier) 
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T&ble 4. Waveband Evaluation Classification Resulte Using Channels 2, 4, 5, & 7 (the best 4). 
(1979 IMS Data, Supervised Training Statistics, GML Classifier) 
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Table 5. Wavdaand Evaluation Classification Results Using Channels 2 , 3 , 4 , & 7 (the best 5) 

(1979 IMS Data, Siqpervised Training Statistics, GHL Classifier) 





i 


Results Using Channels 1, 2, 4, 5, 6, & 7 (the best 6). 
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T&ble 7 . Waveband Evaluation Classification Results Using All 7 Channels, (1979 IMS Datar 
Supervised Training Statistics, GHL Classifier) 
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AVERAGE PERPOIWfiNCE BY CDVER CLASS = 543/8 = 67,9% 


T^le 9. wavdaand Evaluatioi Classification Results Using Reflectiye IR Channels (4 
(1979 TMS Data j. Supervised Training Statistics, GML Classifier) 
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OVERALL PERFORMANCE = 9382/10,557 = 88.9% 

AVERAGE PERFORMANCE BY COVER CLASS = 640.6/8 * 80.1% 



TtedDle 13. Waveband Evaluation Classification Results Using Channels 3, 5^ 6, & 7 (one d 
from each wavelength region) . (1979 IMS Data, Supervised Training Statistics 

Classifier) 
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Table 14. Waveband Evaluation Classification Results Using Channels 2r 4r 6, & 7 (one cliannel 
fron each wavelength region) , (1979 IMS Data, Supervised Training Statistics, (Mi 

Classifier) 
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AVERW3E PERPOFMANCE BY COVER CLASS = 599.8/8 = 75.0% 



l^le 15, Wavd^and Evaluation Classification Results Osirtg Channels 2 & 5 (the best 2) . (1979 IMS 

Data, MCB Training Statistics, GML Classifier) 
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Table 16. Waveband Evaluation Classification Results Using Channels 1, 3, s 5 (the best 3) . 
(1979 IMS Data, MCE Training Statistics, GML Classifier) 
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T^le 17. Vlavebaxid Evaluaticn Classification Results Using Channels 1, 3, 4, & 6 (the best 4) 
(1979 IMS Data, M(B Training Statistics, GML Classifier) 
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Table 18. Waveband Evaluation Classification Results Using Qiannels 1, 2, 3, 4, & 6 (the best 5) . 
(1979 TMS Data, MCE Training Statistics, QiL Classifier) 
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AVER/^E PERPOFMfiNCE BY COVER CLASS = 641,7/8 = 80.2% 
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T&ble 20. Waveband Evaluation Classification Results Using Ml 7 Channels. (1979 IMS Data 



AVERAGE PERFORMANCE BY COVER CLASS = 651.5/8 = 81.4% 












I^le 22. Wavdaand Evaluation Classification Resiilts Using Reflective IR Channels (4 
(1979 IMS Data, MCB Training Statistics, GML Classifier) 
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AVERAGE PEREDRMRtCE BY OCfVER CLASS = 570.3/8 = 71.3% 



T^le 23* Wavdaand Evaluation Classification Results Using the Best 3 Channels Minus tte Near 
Channels 12, 3, & 6) . (1979 IMS Data, MCB Training Statistics, QHL Classifier) 
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AVERflGE PERFORMANCE BY COVER CLASS = 502.5/8 = 62.8% 
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Table 26. Waveband Evaluation Classification Results Using Channels 2,3,4, &5 (sinulated 
Landsat) . (1979 TMS Data, MCB Training Statistics, GML Classifier) 
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Table 27. Waveband Evaluation Classification Results Using Channels 3, 5, 6, & 7 (one channel 

from each wavelength region) . (1979 IMS Data, MCE Training Statistics, OIL Classifier) 
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AVERASE PERPOEMflNCE BY COVER CLASS = 635.5/8 = 79.4% 



Table 28. Waveband Evaluation Q^sification Results Using Channels 2, A, 6, & 7 (one channel 

from each wavelength region) . (1979 TMS Data, MCB Training Statistics, GML Classifier) 
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Table 29, Statistical con^jarison among overall classification results for 
the GML algorithm using various three channel subsets and based 
upon the 1979 supervised training statistics and sample block 
test data. 



Channel^ 

Table 


No. of 

Significantly 

Differences^ 


Subset and 

Location 

% Correct 

Samples 


(lr3,6) 

(Table 3) 

78.4 




Ur2,3) 

(Table 8) 

81.0 



Overall 

Classification 

(4,5,6) 

(Table 9) 

71.9 

10,557 

All 

Performance 

(3,4,5) 

(Table 10) 

85.4 




(2,4,5) 

(Table 11) 

86.9 






Channel combinations which are significantly different are indicated based 
upon a N®flnan-Keuls coii^iarison with « = 0.10, 


2 /, 


Description of the three channel subsets: 

(lf3,6) = the "best 3" channel subset as determined by 'll) (MIN) . 

(©best 3 - Thermal IR) 

(©best 3 - Near IR) 

(1,2,3) = visible channels, and "Best 3" minus Reflective IR 

(4,5,6) = reflective IR channels 

(3.4.5) = "Best 3" channels minus Middle IR channels 

(2.4.5) = the "best 3" channel subset as determined by TD(AVE) , 



179 


Table 30, Statistical comparison among classification results by cover 

class for the GML algorithm using various three channel subsets 
and based upon the 1979 supervised training statistics and 
sample block test data. 



Cover 

Channel 

Table 


No. of 

Significant, / . 


Class 

Subset and 

Location 

% Correct 

Samples 

Differences^ 



(1,3,6) 

(Table 3) 

94.7 


(3,4,5)/All 



(1,2,3) 

(Table 8) 

92.1 


(4,5,6)/(l,3,6) 


PINE 

(4,5,6) 

(Table 9) 

91.2 

775 

(1,2,3)/(1,3,6) 



(3,4,5) 

(Table 10) 

87.1 





(2,4,5) 

(Table 11) 

91.2 



V 


(1,3,6) 


77.8 



1 


(1,2,3) 

Same 

84.6 





(4,5,6) 

as 

69.5 

7269 

All 

> 


(3,4,5) 

above 

88.6 



1 


(2,4,5) 


91.7 



1 

i 


(1,3,6) 


21.2 


(l,3,6)/(2,4,5); (3,4,5); 

1 


(1,2,3) 

Same 

66.1 


(1,2,3) 


lUPE 

(4,5,6) 

as 

30.5 

118 

(4,5,6)/(3,4,5) & (1,2,3) 



U,4,0) 

above 

58.5 


(2,4,5)/(3,4,5) & (1,2,3) 



(2,4,5) 


34.7 





(1,3,6) 


68.1 


(3,4,5)/AU 



(1,2,3) 

Same 

47.6 


(2,4,5)/(l,3,6) 


CCOT 

(4,5 ,b) 

as 

47.3 

370 

(4,5,6)/(l,3,6) 

i 


(3,4,5) 

above 

36.5 


(1,2,3)/(1,3,6) 

1 


(2,4,5) 


42.7 





(1,3,6) 


62.3 


(1,2,3)/A11 

I 

! 


(1,2,3) 

Same 

38.0 


(l,3,6)/(4,5,6) & (3,4,5) 

1 

PAST 

(4,5,6) 

as 

71.7 

350 

(2,4,5)/(4,5,6) & (3,4,5) 

I 


(3,4,5) 

above 

76.0 



1 


(2,4,5) 


64.0 



1 

1 


(1,3,6) 


61.5 


(l,3,6)/(4,5,6); (2,4,5); 



(1,2,3) 

Same 

65.0 


(3,4,5) 

i 

CROP 

(4,5,6) 

as 

69.6 

369 

(l,2,3)/(3,4,5) 

' 1 


(3,4,5) 

above 

74.3 





(2,4,5) 


71.3 



i 


(1,3,6) 


89.8 


(2,4,5)/(3,4,5) & (1,3,6) 

1 


(1,2,3) 

Same 

86.3 


(4,5,6)/(3,4,5) & (1,3,6) 


SOIL 

(4,5, b) 

as 

85.7 

1006 

(l,2,3)/(3,4,5) & (1,3,6) 

K 


(3,4,5) 

above 

89.2 





(2,4,5) 


85.4 



1 

1 


(1,3,6) 


88.0 


(1,2,3)/A11 

1 


(1,2,3) 

Same 

63.3 
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WATER 

(4,5,6) 

as 

84.0 
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OF POOR QUALITY 

1 


(3,4,5) 

above 

87.7 



f 

1 / 

(2,4,5) 


85.0 



i 


upon a Newman="Keuls con^iarison with a = 0.10, 
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Table 31* Statistical tssniarison amcmg overall classification results for the GML 
algorithm asing various three channel subsets and based upon the 1979 MCB 
training statistics and sample bloc^ test data. 



Channel^ 

Table 


No. of 

Significant^ 

Differences^ 


Subset and 

Location 

% Correct 

Samples 


(1,3,5) 

(Table 16) 

76.0 


(1,2,7)/(1,2,3); (1,3,5); 






(2,3,6); (3,4,7) 


(1,2,3) 

(Table 21) 

72.2 


(4,5,6)/(l,2,3); (1,3,5); 

Overall 

Classification 

(4,5,6) 

(Table 22) 

64.6 

10,557 

(2,3,6); (3,4,7) 

Performance 

(2,3,6) 

(Table 23) 

82.1 

(1,2,3)/(1,3,5); (2,3,6) 


(1,2,7) 

(Table 24) 

64.3 




(3,4,7) 

(Table 25) 

84.4 




^Channel combinations which are significantly different are indicated based upon a 
Newman-Keuls con^arison with « = 0.10. 


2 / 

Description of the three channel subsets : 

(l,3r5) = the "best 3" channel subset as determined by TD(MIN); in addition 

to the "best 3" channel subsets minus the Thermal IR and Middle IR 
channels, respectively 

(1,2,3) = Visible channels 


(4.5.6) 

(2.3.6) 

(1.2.7) 

(3.4.7) 


Reflective IR channels 

"Best 3" channel subset minus the Near IR channels 
"Best 3" channel subset minus the Reflective IR channels 
the "best 3" channel subset as determined ly 7D(AVE) 
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arable 32. Statistical cxxnfariSon among classification results by cover class 
for the CML idgorithm using various three channel svds^ts and 
based upon the 1979 MC3B training statistics and sample block test 
data. 


Cover 

Channel 

Table 


No. Of 

Significant^ 

Differences^ 

Class 

Subset and 

Location 

% Correct 

Sanies 


(1,3,5) 

(Table 16) 

833 


(1,2,7)/A11 


(1,2,3) 

(Table 21) 

90.7 


(3,4,7)/m 

PINE 

(4,5,6) 

(Table 22) 

93.8 

775 

(1,3,5)/A11 

(2,3,6) 

(Table 23) 

89.8 

(2,3,6)/(4,5,6) 


(1,2,7) 

(Table 24) 

38.5 


(l,2,3)/(4,5,6) 


(3,4,7) 

(Table 25) 

64.1 




(1,3,5) 


76.1 




(1,2,3) 


74.9 



HDWD 

(4.5.6) 

(2.3.6) 

Sante 

as 

57.0 

85.1 

7269 

All 


(1,2,7) 

above 

63.5 




(3,4,7) 


87.9 




(1,3,5) 


48.3 


(l,3,5)/(3,4,7); (1,2,7); 


(1,2,3) 

(4,5,6) 


76.3 


(1,2,3) 

TUPE 

Same 

55.9 

118 

(4,5,6)/(l,2,7); (1,2,3) 

(2,3,6) 

as 

61.0 

(2,3,6)/(l,2,3) 


(1,2,7) 

above 

72.9 




(3,4,7) 


66.1 




(1,3,5) 


30.3 




(1,2,3) 


35.4 



COJT 

(4.5.6) 

(2.3.6) 

Same 

as 

34.6 

36 

370 

None 


(1,2,7) 

above 

35.4 




(3,4,7) 


35.7 




(1,3,5) 


44.0 


(l,2,3)/(4,5,6); (2,3,6); 


(1,2,3) 


40.0 


(1,2,7); (3,4,7) 

PAST 

(4.5.6) 

(2.3.6) 

Same 

as 

50.0 

57.1 

350 

(l,3,5)/(2,3,6); (1,2,7); 
(3,4,7) 


(1,2,7) 

above 

69.4 


(4,5,6)/(2,3,6) ; (1,2,7); 


(3,4,7) 


72.3 


(3,4,7) 






(2,3,6)/(l,2,7); (3,4,7) 


(1,3,5) 


90.0 


(1,2,3)/A11 


(1,2,3) 


53.4 


(1,2,7)/A11 

CROP 

(4,5,6) 

Same 

97.3 

369 

(2,3,6)/All 

(2,3,6) 

as 

’ 82.9 

(1,3,5)/A11 


(1,2,7) 

above 

68.8 




(3,4,7) 


98.4 
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Cover 

Channel 

Table 


No. of 

Significant 

Differences 

Class 

Subset and 

Location 

% Correct 

Sanies 


(lr3,5) 

(Table 16) 

92.0 


(2,3,6)/All 


(1»2,3) 

(Table 21) 

85.3 


(1,2,3)/A11 

SOIL 

(4,5,6) 

(Table 22) 

96.4 

1006 

(l,2,7)/(4,5,6) 

(2,3,6) 

(Table 23) 

79.2 

(l,3,5)/(4,5,6) 


(1»2,7) 

(Table 24) 

91.0 


(3,4,7)/(4,5,6) 


(3,4,7) 

(Table 25) 

92.9 



(1#3,5) 


86.0 


(1,2,3)/A11 


dr2,3) 


18.3 


(1,2,7)/A11 

WATER 

(4.5.6) 

(2.3.6) 

Same 

as 

85.3 

89.3 

300 


(1,2,7) 

above 

83.0 




(3,4,7) 


85.7 




^Channel csombinations that are si0ii£ican.tly different are indicated based 
upon a Newman-Keuls comparison with a = 0.10. 
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l^ble 33. Statistical canparison among overall classification results for 
the GML algorithm using various four channel subsets and based 
upon the 1979 supervised training statistics and sample block 
test data. 



2/ 

Channel^ 
Subset and 

Table 

Location 

% Correct 

No. of 
Samples 

Significant, i 
Differences'^ 


(2,4 ,5 ,7) 

(Table 4) 

88.1 



Overall 

(2,3 ,4 ,5) 

(Table 12) 

88.9 


All are 

Classification 




10,557 

significantly 

Performance 

(3,5 ,6 ,7) 

(Table 13) 

83.4 


different 


(2,4,6,7) 

(Table 14) 

87.0 




^Channel combinations that are significantly different are indicated 
based upon a Newman-Keuls comparison with a := 0.10. 

2 / 

^Description of the four channel subsets; 

(2. 4. 5. 7) = the "best 4" channel subset as determined by TO(MIN) 

(2,3,4 ^5) = Simulated Landsat channels 

(3. 5. 6. 7) _ Both are four channel subsets with one channel from 

(2.4.6 .7) “ each wavelength region 
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Table 34. Statisticed comparison aniong classification results by cover class for 
the GML algorithm using various four channel sidssets and based upon the 
1979 sup3rvised treiining statistiq^and sample block test data. 

'• *'l**‘»y J.;, 


Cover 

Channel 

Table 


No. of 

Significant^ 

Differences^ 


Class 

Subset and 

Location 

% Correct 

Samples 



(2, 4, 5, 7) 

(Table 4) 

91.0 




PINE 

(2,3, 4,5) 
(3,5,6,7) 

(Table 12) 
(Table 13) 

92.6 

89.5 

775 

> None 



(2,4,6,7) 

(Table 14) 

92.3 





(2,4,5,7) 

Same 

as 

above 

91.1 


(3,5,6,7)/All 


HDWD 

(2,3,4,5) 

(3.5.6 .7) 

(2.4.6.7) 

91.8 

85.7 

90.7 

7269 

(2,4,6,7)/(2,3,4,5) 



(2,4, 5,7) 

Same 

as 

above 

58.5 


(2,4,6,7)/(2,4,5,7) & 

(2,3,4,5) 

TOPE 

(2,3, 4,5) 

78.0 

118 

(3,5,6,7)/(2,4,5,7) & 

(2,3 ,4,5) 

(3.5.6.7) 

(2.4.6.7) 

46.6 

42.4 

(2,4,5,7)/(2,3,4,5) 



(2,4,5 ,7) 

Same 

as 

above 

60.5 


(2,3,4,5)/All 


CCUT 

(2,3,4,5) 

(3.5.6.7) 

(2.4.6 .7) 

51.4 

63.0 

58.6 

370 




(2,4,5,7) 

Same 

as 

above 

82.6 


(2,3,4,S)/(2,4,6,7) S 

(2,4,5,7) 

PAST 

(2,3,4,5) 

(3.5.6. 7) 

(2.4.6.7) 

71.1 

74.9 

82.3 

350 

(3,5,6,7)/(2,4,6,7) & 

(2,4,5, 7) 


(2, 4,5, 7) 

Same 

as 

above 

79.7 


(2,4,6,7)/(2,3,4,5) & 

(2, 4, 5,7) 

CROP 

(2,3,4 ,5) 

(3.5.6.7) 

(2. 4.6. 7) 

79.1 

73.7 

71.5 

369 

(3,5,6,7)/(2,3,4,5) 



(2,4,5,7) 

Same 

as 

above 

85.6 


(2,4,6,7)/All 


SOIL 

(2,3 ,4,5) 

90.3 

1006 

(3,5,6,7)/(2,3,4,5) 


(3.5.6.7) 

(2.4.6.7) 

84.2 

81.0 

(2,4,5,7)/(2,3,4,5) 


WATER 

(2.4. 5. 7) 
(2, 3, 4,5) 

(3. 5. 6. 7) 

(2.4.6.7) 

Scflne 

as 

above 

88.1 

88.9 

83.4 

87.0 

10,557 

All 



^Channel combinations that are significantly different are indicated based upon a 
Newman-Keuls oon^arison with a = 0.10. 
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Stable 35. Statistical con^sarison among overall classification results for the 
GKL algorithm using various four channel subsets and based tj^n the 
1979 MCB training statistics and sample block test data. 



Channel^ 

Table 


NO. of 

Significant, , 
Differoices" 


Subset and 

Location 

% Correct 

Sandies 


(lr3,4,6) 

(Table 17) 

86.1 


(3,5,6,7)/All 

(l,3,4,6)/(2,3,4,5) 

Overall 

Classification 

(2,3,4,5) 

(Table 26) 

87.8 

10,557 

(2,4,6,7)/(2,3,4,5) 

Performance 

(3,5,6, 7) 

(Table 27) 

85.3 




(2,4,6,7) 

(Table 28) 

86.4 




^Channel canbinations that are significantly differait are indicated based upon 
a N&raan-Keuls con^arison with a = 0.10. 

2 / 

"Description of the four channel subsets: 


(1.3. 4. 6) = the "best 4" channel subset as determined by TO (MIN) 

(2,3 ,4 ,5) = Simulated Landsat channels 

(3.5.6 .7) _ Both are four channel subsets with one channel from 

(2.4.6 .7) “ each wavelaigth region 
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T^le 36. S'catistical oonparison amcxig classification results by cover class for 
the GML algori^ using various four channel subsets and based u^n the 
1979 MCB training statistics and saniEde block test data. 


Cover 

Class 

Channel 
Subset and 

Table 

Location 

% Correct 

No. of 
Samples 

Significant, / 
Differeices^ 

PINE 

dr3,4,6) 
(2, 3, 4, 5) 

(3. 5. 6. 7) 

(2. 4. 6. 7) 

(Table 17) 
(Table 26) 
(Table 27) 
(Table 28) 

91.9 

94.1 

93.7 

92.3 

775 

None 

HDWD 

(lr3,4,6) 
(2,3 ,4,5) 

(3 .5 .6 .7) 

(2.4.6.7) 

Same 

as 

above 

88.4 
90.1 

86.4 
87.9 

7269 

(3.5.6.7) /AU 

(2.4.6.7) /(2,3,4,5) 
(l,3,4,6)/(2,3,4,5) 

TOPE 

(lr3,4,6) 
(2,3 ,4,5) 

(3.5.6.7) 

(2.4.6.7) 

Same 

as 

above 

62.7 
82.2 
71.2 

79.7 

118 

(l,3,4,6)/(2,4,6,7) & (2,3,4,5) 

CCUT 

(1.3.4.6) 
(2,3,4,5) 

(3.5.6.7) 

(2.4.6.7) 

Same 

as 

above 

41.9 

37.8 

41.4 

40.3 

370 

None 

PAST 

(1.3 .4.6) 
(2,3,4,5) 

(3.5.6.7) 

(2.4.6.7) 

Same 

as 

above 

51.4 

51.1 

67.1 

69.1 

350 

(2.3.4.5) /(3,5,6,7) & (2, 4,6,7) 

(1.3.4.6) /(3,5,6,7) & (2,4,6,7) 

CROP 

(1.3.4.6) 
(2,3 ,4,5) 

(3.5.6. 7) 

(2.4.6.7) . 

Same 

as 

above 

99.2 

99.2 

98.1 

98.1 

369 

None 


dr3,4,6) 

QamA 

91.3 


(2,4,6,7)/(2,3,4,5) 

530TT. 

(2, 3,4,5) 


95.0 


(3,5,6,7)/(2,3,4,5) 


(3,5,6,7) 

CIO 


XUUQ 

d,3,4,6)/(2,3,4,5) 


(2,4,6,7) 

CiLAJVtj 




WATER 

(1.3.4.6) 
(2,3,4,5) 

(3.5.6.7) 

(2.4. 6.7) 

Same 

as 

above 

87.3 

86.3 

87.3 

85.3 

300 

None 


^Channel combinations that are significantly different are indicated based upon a 
Nwman-Keuls con^arison with a = 0.10. 





187 


APPENDIX B (Tables 37-77) 


CoTOEarisons Among Classification Algorithms (L2, GML and SBCHO) 
for both the 1979 and 1980 TMS Data Set 


Table 38. Classification Results Based Upon Supervised Training Statistics and the L-2 
Distance Classifier, Using Channels 2, 4, 3, & 7 of the 1979 IMS Data. 
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Table 39. Classification Results Based Upon Supervised Training Statistics and the GML Classifier 
Using Channels 2 , 4r 5, & 7 (the best 4) of the 1979 TMS Data, 
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Table 40. Classification Results Based Upon Supervised Training Statistics and the ECHO 
Classifier, Using Channels 2, 4, 5, & 7 of the 1979 IMS Data. 








Table 41. Classification Results Based Upon Supervised Training Stat i stics and the I/-2 Mini 
Distance Glassifier, Using All 7 Channels of the 1979 IMS Data. 
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Table 42. Classification Results Based Upon Supervised Training Statistics cind the GML Classifier 
Using All 7 Channels of the 1979 IMS Data. 
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Table 43. Classification Results Based Upon Supervised Training Statistics and the BCBO 
Classifier, Using All 7 Channels of the 1979 TMS Data. 
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3!lable 44. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the L-*2 
Mininum Distance Classifier, Using Channels 1, 3, 4, &6 of the 1979 THS Data. 
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Table 45. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the GML 
Classifier, Using Channels 1, 3, 4, & 6 (the best 4} of the 1979 QMS Data. 
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Table 48. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the GML 
Classifier, Using All 7 Channels of the 1979 WS Data. 
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Table 49. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the ECHO 
Classifier, Using All 7 Channels of the 1979 IMS Data. 
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AVE^E PERTOFMANCE BY COVER CLASS = 651.8/8 = 81.5% 
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Tsdsle 50. Statistical con^rison among overall classification results for 
all three algorithms (12, CML, SBCHO) using the "best 4" channel 
subset (2, 4, 5, 7) and based upon the 1979 supervised training 
statistics and sample block test data. 



Table 

Algorithm and Location 

% Correct 

No. of 
Samples 

Significant^ 

Differences^ 


L2 

(Table 38) 

81.8 



Overall 






Classification 

GML 

(Table 39) 

88.1 

10,557 

All 

Performance 







SE3CHO 

(Table 40) 

90.0 




^Classification algorithms which are significantly different are indicated 
based upon a Newman-Keuls oon^arison with a = 0.10. 
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Table 51. Statistical con^nison aitiong classification results by 
cover class for all three algorithins (L2, OIL, SE3CH0) 
using the "best 4" channel subset (2,4 rSr?) 2 ind based 
upon the 1979 supervised training statistics and sample 
block test data. 


Cover 


Table 


No. of 

Significant^ 

Differences^ 

Class 

Algorithm and Location % Correct 

Samples 


L2 

(Table 38) 

85.5 


L2/GML 

PINE 

GML 

(Table 39) 

91.0 

775 

I2/SBCH0 


SBCHO 

(Table 40) 

92.9 



HDWD 

12 

GML 

SBCHO 

Same 

as above 

84.0 

91.1 
93.7 

7269 

All 

TUPE 

L2 

GML 

SBCHO 

Same 

as above 

55.1 

58.5 

57.6 

118 

None 


L2 

Same 

as above 

68.6 


I2/GML 

OCOT 

GML 

SBCHO 

60.5 

58.5 

370 

L2/SBCHO 


L2 

Same 

as above 

70.9 


L2/GNL 

PAST 

GML 

SBCHO 

82.6 

83.1 

350 

L2/SBCHO 


L2 

Same 

as above 

88.1 


I2/GML 

CROP 

GML 

SBCHO 

79.7 

81.6 

369 

L2/SBCHO 


12 

Same 

as above 

71.6 


I^/GML 

SOIL 

GML 

SBCHO 

85.6 

86.0 

1006 

L2/SBCHO 


12 

Same 

as above 

85.7 


L2/0«L 

WATER 

GML 

SBCHO 

78.7 

79.7 

300 

L2/SBCHO 

^Classification algorithms which 

are significantly different are 


indicated based upon a Newman-Keuls coirparison with a = 0.10. 
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Table 52. Statistical car^rison among overall classification results for 
all three algorithms (L2, GML, SECHO) using all 7 channels and 
based upon the 1979 supervised training statistics and sample 
block test data. 




Table 


No. of 

Significant^ 

Differences^ 


Algorithm and Location 

% Correct 

Samples 


12 

(Table 41} 

85.3 



Overall 

Classification 

GML 

(Table 42) 

90.7 

10,557 

All 

Performance 

SECHO 

(Table 43) 

91.6 




^Classification algorithms which are significantly different are indicated 
based upon a Newman-Keuls comparison with a = 0.10. 
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Table 53. Statir!:ical con^rison amcxig classification results by 
cover class for all three algorithms (L2, OIL, SBOX)) 
usii^ all 7 channels and based iqpon the 1979 supervised 
training statistics cind sample block test data. 


Cover 


Table 


No. of 

Significantly 

Differeices**^ 

Class 

Algorithm and Location 

% Correct 

Samf^es 


12 

(Table 41) 

91.5 


12/GML 

PINE 

OIL 

(Table 42) 

95.0 

775 

L2/SBCHO 


SBCHO 

(Table 43) 

94.7 


HEND 

12 

GML 

SECHO 

Same 

as above 

88.2 
93.2 
94. B 

7269 

All 

TUPE 

12 

GML 

SBCHO 

Same 

as above 

68.6 

67.8 

65.3 

118 

None 

COJT 

12 

GML 

SBCHO 

Same 

as above 

65.4 

64.9 

64.6 

370 

None 


12 

Same 
as above 

7U.3 


12/01L 

PAST 

GML 

SBCHO 

83.4 

84.6 

350 

I2/SBCH0 


12 

Same 

as above 

87.8 


L2/(3fL 

CROP 

GML 

SBCHO 

81.0 

81.0 

369 

L2/SBCHO 


12 

Same 

as above 

73.2 


12/GML 

SOIL 

GML 

SBCHO 

90.6 

90.6 

1006 

L2/SBCHO 


12 

Same 
as above 

87.3 


12/GML 

WATER 

GML 

SBCPO 

90.7 

91.6 

300 

L2/SBCH0 


^Classification algorithms which are significantly differait are 
indicated based upon a Nesmian-Keuls con^rison with a = 0.10. 
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Table 54, Statistical con^rison among overall classification results for 
all three algorithms (L2, GML, SECHO) using the "best 4" channel 
subset (1,3 ,4 ,6) and based upon the 1979 MCB training statistics 
and s^ple block test data. 




Table 


NO. of 

Significant,/ 

Differences^ 


Algorithm and Location 

% Correct 

Samples 


12 

(Table 44) 

77.4 



Overall 

Classification 

Performance 

GML 

86,1 

10,557 

All 

(Table 45) 


SEICHO 

(Table 46) 

90.6 




^Classification algorithms which are significantly different are indicated 
based upon a Newnan-Keuls cort^arison with a = 0.10. 
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Table 55. Statistical canpacison among classification results ty 
cover wlass for all three algorithms (L2r GMLr SBCHO) 
using the "best 4" channel subset (If3r4,6) and baaed 
upon the 1979 MCB training statistics and sample block 
test data. 


Cover 

Class 

Table 

Algorithm and Location 

% Correct 

No. of 
Sanples 

Significantly 

Differences^ 

PINE 

12 

GML 

SBCHO 

(Table 44) 
(Table 45) 
(Table 46) 

85.3 

91.9 

94.8 

775 

All 

HDND 

L2 

GML 

SBCHO 

Same 

as above 

76.8 

88.4 

94.7 

7269 

All 

TOPE 

12 

GML 

SBCHO 

Sane 

as above 

47.5 

62.7 

40.7 

118 

12/(XL 

O^SBCHO 

CCUT 

12 

GML 

SBCHO 

Sane 

as above 

31.9 

41.9 
39.5 

370 

I2/GML 

L2/SBCH0 

PAST 

L2 

GML 

SBCHO 

Sane 

as above 

50.6 

51.4 

47.4 

350 

None 

CROP 

12 

GML 

SBCHO 

Same 

as above 

97.0 

99.2 

98.6 

369 

I2/GML 

SOIL 

12 

GML 

SBCHO 

Same 
as above 

93.9 

91.3 

94.7 

1006 

L2/GML 

GI^SBCHO 

WATER 

12 

GML 

SBCHO 

Same 
as above 

88.7 

87.3 

89.0 

300 

None 


^Classification algorithms which eire significantly different are 
indicated based upon a Newman-Keuls oontarison with ot = 0.10. 
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Tsdale 56. Statistical con^arison eBiK»ig ov eral l, classification results for 
all three algorithms (12, GNL, SBCHO) using all 7 channels and 
based upon the 1979 MCB training statistics and sample block 
test data. 




Table 


No. of 

Significantly 

Differences^ 


Algorithm and Lcxiation 

% Correct 

Samples 


12 

(Table 47) 

81.4 



Overall 

Classification 

GML 

(Table 48) 

88.7 

10,557 

All 

Performance 

SBCHO 

(Tcible 49) 

92.3 




^Cleissification algorithms which are significantly differoit are indicated 
based upon a Newman~Keuls oon^rison with a » 0.10. 
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Table 57. Statistical oxnparison among classification results by 
cover class for all three algorithms (L2, GML> SBQK)) 
usi]^ all 7 channels and based upon the 1979 NC8 
training statistics and sample block test data. 


Cover 

Class 

Table 

Algorithm and Location 

% Correct 

No. of 
Samples 

Significant^ 

Differences^ 

ph^ 

£2 

GML 

SBCHO 

(Table 47) 
(Table 48) 
(Table 49) 

89.3 

93.3 
94.6 

775 

£2/caiL 

L2/SBCHO 

HDWD 

£2 

GML 

SBCHO 

Same 

as above 

82.1 

91.1 

96.1 

7269 

All 

TUPE 

12 

GML 

SECHO 

Same 

as above 

58.5 

83.9 

79.7 

118 

L2/GML 

I2/SECHO 

CCUT 

12 

GML 

SECHO 

Same 

as above 

35.1 

45.7 

45.4 

370 

L2/OIL 

L2/SBCH0 

PAST 

12 

OIL 

SECHO 

Same 
' ^ above 

66.0 

61.4 

56.9 

350 

L2/SBCH0 

CROP 

12 

GML 

SECHO 

Same 
as above 

98.6 

98.6 

97.6 

369 

None 

SOIL 

£2 

GML 

SECHO 

Same 

as above 

87.1 

90.8 

92.5 

1006 

£2/01L 

L2/SECH0 

WATER 

L§ 

GML 

SECHO 

Same 
as above 

87.3 

86.7 

89.0 

300 

None 


^Classification algorithms which are significantly different are 
indicated based upon a Newman-Keuls con^rison with a = 0.10. 
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Ofeble 59. Classification Results Based Upon Su^n^i^^Training Statistics and the GML aassifier, 
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Table 60, Classification Results Based Upon Si^rvised Training Statistics and the SBCBO 
Classifier, Using Channels 1, 2, 3, & 6 of the 1980 IMS Data, 


• f ■ 
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Table 61. Classification Results Based Upon Supervised Training Statistics and the L-2 
Distance Classifier, Using All 8 Channels of the 1980 TMS Data. 
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Table 62. Classification Results Based Upon Supervised Training Statistics and the GML Classifier 
Using All 8 Channels of the 1980 THS Data. 


I 







ORIGINAL PAGE IS 
OF POOR QUALITY 


214 



T^le 64. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the L-2 
Minimum Distance Classifier, Using Channels 1, 3, 4, & 5 of the 1980 IMS Data. 
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Table 65. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the GML 
Classifier, Using Channels 1, 3, 4, & 5 of the 1980 TOS Data. 



ORIGINAL PAGE IS 
OF POOR QUALITY 





Table 66 « Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the SBCBO 
Classifier, Using Channels 1, 3, 4, & 5 of the 1980 TMS Data. 
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•[teble 67. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the L-2 
Minimum Distance Classifier, Using All 8 Channels of the 1980 IMS Data, 



^IGINAL PAGE !S 
OF POOR QUALITY 




Table 68. Classificaticxi Results Based Upon Multi-Cluster Blocks Training Statistics and the OIL 
Classifier, Using All 8 Channels of the 1980 TMS Data. 
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Table 69. Classification Results Based Upon Multi-Cluster Blocks Training Statistics and the SEXZHO 
Classifier, Using All 8 Channels of the 1980 IMS Data, 




■1 
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Table. 70, Statistical ccnt^arison among overall classification results for 
all 3 algorithms (L2, GMLr SEICHO) using the "best 4" channel 
subset (1,2, 3, 6) and based upon the 1980 supervised training 
statistics and sample block test data. 




Table 


No. of 

Significant, / 
Differences'^ 


Algorithm and Location 

% Correct 

Sam{d.es 


12 

(Table 58) 

75.3 



Overall 

Classification 

QOi 

(Table 59) 

82.8 

9667 

All 

Performance 

SE)CHO 

(Table 60) 

85.9 




1 / 

'^Classification algorithms which cure significantly different are indicated 
based upon a Newroan-Keuls oon^arison with a =0.10. 
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Table 71. Statistical con^rison amcHig classification results by 
cover class for all three algorithms (L2, GKL, SBCHO) 
using the "best 4" channel subset (1,2 f3 ,6) and based 
upon the 198C supervised training statistics and sample 
block test data. 


Cover 

Class 

Table 

Algorithm and Location 

% Correct 

No. of 
Samples 

Significant,/ 

Differences^ 

PINE 

L2 

GML 

SBCHO 

(Table 58) 
(Table 59) 
(Table 60) 

66.2 

72.5 

71.5 

393 

None 

mjD 

12 

GML 

SECHO 

Same 

as above 

90.8 
. 92.8 

6584 

All 

TOPE 

L2 

GML 

SBCHO 

Same 

as above 

22.6 

19.3 

19.3 

145 

Nc»ie 

RGHD 

L2 

GML 

SBCHO 

Same 
as above 

40.2 

55.0 

72.9 

458 

All 

PAST 

L2 

(Mi 

SBCHO 

Same 

as above 

63.2 

48.5 

40.4 

. 408 

All 

CPDP 

L2 

GML 

SBCHO 

Same 

as above 

A6,l 

73.6 

88.0 

890 

All 

SOIL 

L2 

GML 

SBCHO 

Same 

as above 

73.3 

78.6 

78.6 

439 

L2/GML 

L2/SBCHO 

WATER. 

12 

GML 

SBCHO 

Same 
as above 

90.3 

74.6 

74.6 

350 

I2/GML 

L2/SBCHO 


^Classification algorithms which are significantly different are 
indicated based upon a Newman-Keuls con^rison with a = 0.10. 
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TSable 72. Statistical con^arison among overall classification results for 
all 3 algorithms (12, (XL, SE3CH0) using all 8 channels ansi based 
upon the 1980 supervised training statistics and sample block 
test data. 


Table No. of Significant^ 

Algorithm and Location % Correct Samples Differences*^ 



L2 

(Table 61} 

77.5 

Overall 

Classification 

GML 

(Table 62) 

88.5 

Performance 


SEX3K) 

(Table 63) 

89.6 


9667 


All 


^Classification edgorithms which are significantly different are indicated 
based upon a Newman-Keuls comparison with a - 0.10. 
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T!c±)le 73, Statistical coiiparison among classification results by 
cover class for all three cdgorithms (L2, GML, SEICHO) 
using all 8 channels and based upon the 1980 supervised 
training statistics and sample block test data. 


Cover 


Table 


No. of 

Significant. / 
Differences'^ 

Class 

Algorithm and Location 

% Correct 

Samples 


12 

(Table 61) 

68.4 


I2/GML 

PINE 

GML 

(Table 62) 

75.6 

393 

I2/SBCHO 


SECHO 

(Table S3) 

75.1 



HDND 

12 

GKL 

SECHO 

Same 

as above 

81.1 

92.8 

93.9 

6584 . 

Ml 

TUPE 

12 

GML 

SECHO 

Same 

as above 

61.4 

19.3 

19.3 

145 

I2/GML 

I2/SECH0 

R3HD 

12 

GML 

SECHO 

Same 

as above 

17.7 

81.4 

89.5 

458 

All 


12 

Same 

as above 

81.4 


I2/GML 

PAST 

GML 

SECHO 

50.0 

50.2 

408 

L2/SECHO 


12 

Same 

as above 

80.8 


L2/GML 

CROP 

GML 

SECHO 

98.3 

98.9 

890 

L2/SECHO 


L2 

Same 

as above 

76.8 


L2/CML 

SOIL 

GML 

SECHO 

92.7 

92.7 

439 

L2/SBCHO 


12 

Same 

as above 

92.3 


I2/GML 

WATER 

(Mi 

SECHO 

73.7 

73.7 

350 

L2/SECH0 


^Classification algorithms which are significantly different are 
indicated based upon a Newman-Keuls comparison with a = 0.10. 
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Table 74. Statistical con^rison among overall classification results for 
all 3 algorithms (L2, GML, SEX310) using the "best 4" channel 
subset (1,3 r4, 5) and based upon the 1980 MCB training statistics 
and sample block test data. 



Table 

Algorithm and Location 

% Correct 

No. of 
Samples 

Significant, y 
Differences^ 

Overall 

L2 

(Table 64) 

67.6 


. 

Classification 

Performance 

GNL 

(Table 65) 

79.7 

9667 

All 


SBCHO 

(Table 66) 

84.6 

- 



^Classification algorithms which are significantly differeit are indicated 
based upon a Newman-Keuls con^arison with a = 0.10. 


Table 75 


OF 

. Statistical con^rison am<»ig classification results, by 
cover class for cill three algorithms (12, CKL, SEXIiO) 
using the "best 4" channel subset (1,3, 4^5) ba;3ed 
upon the 1980 MCB training statistics and sample block 
test data. 


page 'S 
qUAUT^ 


Cover 

Class 

Table 

Algorithm and Location 

% Correct 

No. of 
Samples 

Significant,/ 

Differences^ 

PINE 

12 

GML 

SBCHO 

(Table 64) 
(Table 65) 
(Table 66) 

79.4 

82.2 

80.9 

393 

None 

HDND 

12 

GML 

SBCHO 

Same 

as above 

69.3 

83.1 

90.9 

6584 

All 

TOPE 

12 

GML 

SECHO 

Same 
as above 

10.3 

17,9 

18,6 

145 

L2/OCi 

L2/SECHO 

PGHD 

12 

GML 

SECHO 

Same 
as above 

43.7 

68.1 

69.1 

458 

L2/GML 

L2/SBCHO 

PAST 

12 

GML 

SBCHO 

Same 

as above 

69.6 

78.7 
78.9 

408 

I2/GML 

L2/SBCHO 

CROP 

12 

GML 

SECHO 

Same 

as above 

57.8 

60.3 

59.4 

890 

None 

SOIL 

12 

GML 

SBCHO 

Same 
as above 

72.7 
86.3 

79.7 

439 

All 

WATER 

12 

GML 

SBCHO 

Same 
as above 

94.3 

94.9 

94.9 

350 

None 


^Classification algorithms which are significantly different are 
indicated based upon a Newman-Keuls con^arison with a = 0.10, 
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Table 76. Statistical carparison among overall classification results for 
all 3 algorithms (L2r dfL, SBCHO) using all 8 channels and based 
upon the 1980 MCB training statistics and sample block test ^ta. 




Table 


No. of 

Significant^ 

Differences^ 


Algorithm and Location 

% Correct 

Sanples 


12 

(Table 67) 

70.2 



Overall 

Classification 

GKL 

(Tcdale 68} 

79.8 

9667 

All 

Iterformance 

SEJCHO 

(Table 69) 

84.2 




^Classifivcatic»i algorithms which are significantly different are indicated 
based upon a Newman-Keuls comparison with a = 0.10. 
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Table 77. Statistical ccxiparison among classification results by 
cover class for all three algorithms (12 r GML, SEX310) 
usii^ all 8 channels and based \^n the 1980 MC8 
training statistics and sample block test data. 


Cover 


Table 


No. of 

Significant, / 
Differences^ 

Class 

Algorithm and Location 

% Correct 

Samples 


12 

(Table 67) 

76.6 

393 

I2/GML 

PINE 

GML 

(Table 68) 

82.7 


I2/SBCH0 


SBCHO 

(Table 69) 

83.7 



HDND 

12 

GML 

SEOiO 

Same 

as above 

69.7 

83.4 

90.2 

6584 

All 


12 

Same 

as above 

35.2 


I2/GML 

TUPE 

GML 

SBCHO 

20.0 

19.3 

145 

L2/SBCHO 


12 

Same 

as above 

58.3 


I2/GML 

RGHD 

GML 

SBCHO 

71.4 

70.7 

458 

L2/SBCHO 

PAST 

12 

GML 

SBCHO 

Same 
as above 

72.1 

76.0 

74.8 

408 

Ncne 


L2 

Same 

as above 

72.6 


L2/GML 

CROP 

GML 

SECHO 

55.6 

53.8 

890 

I2/SBCH0 


12 

Same 
as above 

70.6 


I2/GML 

SOIL 

GML 

SECHO 

92.9 

93.4 

439 

I2/SECH0 

WATER 

12 

GML 

SBCHO 

Same 

as above 

94,0 

94.3 

94.3 

350 

None 


‘^Classification algorithms which are significantly different are 
indicated based upon a Nwinan-Keuls comparison with a = 0,10. 
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Coirparisons Between the 1979 Original TMS Data Set and the 
1979 K-L Transformed IMS Data Set 
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Table 78. Sunnary table of overall classification performances conpiring ^e 
untransformed OMS and the K-L transformed data sets for all three 
classifiers. 


Classifier 

iniVM 1 if. I 

Untransformed TMS^ 
(Channels 1.3.6^ 

’irHWyiWAI 

Table 

.Location 

a VI. 

K-L Transfomted Data 
(Compcments 1,2,3) 

Table 

Location 

12 

65.2^ 

(Table 80) 

80.0%^ 

(Table 83) 

GML 

78.4^ 

(Table 81) 

82.9^ 

(Table 84) 

SECHO 

86,8® 

(Table 82) 

86.6® 

(Table 85) 


Data Subset; "Best 4" Channels or 1st 4 ComDonents 


Classifier 

Untransformed 
(Channels 2. 4,5.7) 

Table 

Locaticxi 

K-L Transformed Data 
(Conconents 1,2,3, 4) 

Table 

Location 

12 

81.8® 

(Table 86) 

83.8^ 

(Table 89) 

OIL 

88. 1*^ 

(Table 87) 

84.6® 

(Table 90) 

SECHO 

90.0^ 

(Table 88) 

87.0^ 

(Table 91) 


^Significantly differait overall classificaticxi performances between the 
untransformed and the K-L transformed data sets for each classifier is 
indicated by a different superscript (bEused upon a Newman-Keuls cei^rison 
with a = 0.10) . 
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Table 79, Sunmary table of overall class performances for three algorithms (L2, GML, 
SECHO) based upon four data sets. 


Data Set Description 

3 Channels (1,3,6), Untransformed 
1st 3 Conponents, K-L Transformed 

4 Ciiannels (2,4,5 ,7) , Untransformed 
1st 4 Components, K-L Transformed 


Overall Classification Performance (%) 
bv Classifier (and Table 




65.2^ (Table 80) 
80.0^ (Table 83) 
81.8^ (Table 86) 
83.8^ (Table 89) 


ai L 

78.4^ (Table 81) 
82.9^ (Table 84) 
88.1^ (Table 87) 
84.6^ (Table 90) 


SBOJQ 

86.8^ (Table 82) 
86.6^ (Table 85) 
90 .0^^ (Table 88) 
87.0*^ (Table 91) 


'^Different superscripts between columns of the same row indicate significantly different 
overall classification performances between classifiers (based upon a Newman-Keuls 
comparison with a =0.10). 


Table 80. Classification Results Based Upon Si^jervised Training Statistics and the L2 Classifier 
Using the Best Three Channels (1, 3, 6) of the 1979 DiS Data. 
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Table 81. Waveband Evaluation Classification Results Using Channels If 3f & 6 (the best 3). (1979 

IMS Data, Supervised Training Statistics, OHL Classifier) 
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Table 82, Classification Results Based Upon Su^rvised Training Statistics and the SBCHO 
Classifier, Using the Best Three Charinels (1, 3, 6) of tiie 1979 TMS Data. 


ORIGINAL « 
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Table 84. Classification Results Based Upon Si:$)ervised Training Statistics and the GML Classifier 
Using the 1st 3 Contonents of the 1979 Principal Component TMS Data. 
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!Bable 86, Classification Results Using Channels 2f 4, 5r & 7 (the best 4), (1979 IMS Data 

Supervised Training Statistics# L2 Classifier) 


ORIGINAL PAG£ IS 
Of POOR QUALITY 


238 




Table 87. Classification Results Using Channels 2, 4, 5, & 7 (the best 4). (1979 Data 

Supervised Training Statistics, 31L Classifier) 
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Table 88. Classification Results Using Channels 2, 4,5, & 7 (the best 4). (197S IMS Data 

Si^rvised Training Statistics, SE)CHO Classifier) 
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Table 89, Classification Results Based Upon Supervised Training Statistics and the L2 Classifier 
Using the 1st 4 Conponents of the 1979 Principal Corponent TMS Data. 
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AVERflGE PERFORMANCE BY COVER CLASS = 621/8 = 77.6% 



ISJale 90, Classification Results Based Upon Si^cvised Training Statistics and th*i GML Classifier 
Using the 1st 4 Canponents of the 1979 Principal Component IMS Data. 
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Table 91, Classification Results Based Upon Supervised Training Statistics and the SE3CH0 
Classifier, Using the 1st 4 Components of the 1979 Principal Component IMS Data 
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Table 92. Statistical comparison among overall classification results for 
all 3 algorithms (L2, GNL, SBCHO) using the first 3 components of 
the 1979 K-L transformed TMS data and based upon the 1979 
supervised statistics and sample block test data. 




Table 


No. of 

Significant, , 
Differences'^ 


Algorithm 

Location 

% Correct 

Samples 

' 

L2 

(Table 83) 

80.0 



Overall 

Classification 

GML 

(Table 84) 

82.9 

10,557 

All 

Performance 

SECHO 

(Table 85) 

86.6 




‘^Classificaticxi algorithms which are significantly different are indicated 
based upon a Newman-Keuls cxmparison with a = 0.10. 
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Table 93. Statistical coi^rison among classification results lay 
cover class for all three algorithms (L2, OIL, SECHO) 
using the first 3 caipjnents of the 1979 K-L transformed 
OMS data and based upon the 1979 supervised training 
statistics and sample block test data. 


Cover 

Class 

Algorithm 

Table 

Location % Correct 

No. of 
Sandies 

Significantly 

Differences^ 

PINE 

L2 

GML 

SECHO 

(Table 83) 
(Table 84) 
(Table 85) 

89.0 

90.1 

91.2 

775 

None 

HDHD 

L2 

GML 

SECHO 

Same as 
above 

80.9 

85.9 
91.3 

7269 

All 

TOPE 

L2 

GML 

SECHO 

Same as 
above 

50.8 

45.8 
52.5 

118 

None 

OCUT 

L2 

GML 

SECHO 

Same as 
above 

61.1 

47.8 

50.8 

370 

00/12 

SECH0/I2 

PAST 

L2 

.GML 

SECHO 

Same as 
above 

69.4 

80.0 

84.9 

350 

All 

CROP 

L2 

GML 

. SECHO 

Same as 
above 

89.7 

87.0 

87.3 

369 

None 

SOIL 

L2 

SECHO 

Same as 
above 

75.2 

74.3 
70.6 

1006 

SBCHO/OIL 

SECH0/L2 

WATER 

L2 

GML 

SECHO 

Same as 
above 

87.0 
76.3 

73.0 

300 

SECH0/L2 

GML/12 

^Classificaticai algorithms which 

are significantly differ ait are 


indicated based upon a Newman-Keuls conqparison with a = 0.10. 



Table 94. Statistical compacison between overall classification results for the 
12 classifier for two dimensioiality reduction techniques (Feature 
Selection, K-L transformation) for the best 3 channel feature set 
based upon 1979 supervised training statistics and sample block test 
data. 



Reduction, / 
Techniques 

Table 


No. of 

Significant2/ 

Difference?^ 


Location 

% Correct 

Samples 

Overall 

Classification 

Feature Selection 
(Untransformed) 

(Table 80) 

65.2 

10,557 

Yes 

Performance 

K-L Transformed 

(Table 83) 

80.0 




^Feature selection optimum subset includes channels 1, 3, & 6 of the original 1979 
^IMS data set. 


K-L transformaticMi includes the first 3 ccm^onents of the K-L transformed 1979 
data set. 

^Classification performance differeice is based upon a Newroan-Keuls oon^arison 
with a = 0.10, 
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Table 96. Statistical confarison between overall classification results for the 
GML classifier for two dimensionality reducticMi techniques (Feature 
Selection, K-L transformation) for the best 3 channel feature set 
based u^n 1979 supervised training statistics and sample block test 
data. 



Reduction, y 
Technique^ 

Table 


No, of 

Significant2/ 

Difference?^ 


Location 

% Correct 

Samples 

Overall 

Classification 

Feature Selecticwi 
(Untransformed) 

(Table 81) 

78.4 

10,557 

Yes 

Perfoi7«iance 

K-L Transformed 

(Table 84) 

82.9 




^Feature selection optimum subset includes channels 1, 3, & 6 of the original 1979 
7NS data set. 


K-L transformation includes the first 3 components of the K-L trcmsformed 1979 
data set. 

2 / 

^Classification performance difference is based i:pon a NeMman-Keuls comparison 
with a = 0.10. 
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l^le 97. Statistical comparison between classification resists for the 
GML classifier by cover class for two dimensionality redaction 
techniques (Feature Selection, K-L transformation) for the 
best three channel feature set based i;qpon 1979 siqpervised 
training statistics and sample block test data. 


Cover 

Reduction^ , 
Techniques 

Table 


No. of 

Significant2/ 

Difference?^ 

Class 

Location 

% Correct 

Sanqples 


Feature Selection 

(Table 81) 

94.7 



PINE 

(imtransforrned) 



775 

Yes 


K-L Transformed 

(Table 84) 

90.1 




Feature Selection 


77.8 



HEMD 

(Untransformed) 

Same as 
above 


7269 

Yes 


K-L Transformed 


85.9 




Feature Selection 


21.2 



TUPE 

(Untransformed) 

Same as 
above 


118 

Yes 


K-L Transformed 


45.8 




Feature Selection 


68.1 



OCUT 

(Untransformed) 

Sane as 
above 


370 

Yes 


K-L Transformed 


47.8 




Feature Selecticxi 


62.3 



PAST 

(Untransformed) 

Same as 
above 


350 

Yes 


K-L Transformed 


80.0 




Feature Selecticai 


61.5 



CROP 

(Untransformed) 

Same as 
above 


369 

Yes 


K-L Transformed 


87.0 




Feature Selection 


89.8 



SOIL 

(Untransformed) 

Same as 
above 


1006 

Yes 


K-L Transformed 


74,3 




Feature Selectiai 


88.0 



WATER 

(Untransformed) 

Same as 
above 


300 

Yes 


K-L Transformed 


76.3 




^Feature selection c^imum subset includes channels 1, 3, & 6 of the 
original 1979 IMS data set. 


K-L transformation includes the first 3 components of the K-L 
transformed 1979 data set. 

2 / 

^Classification performance differences are based t;pon a Newraan-Keuls 
comparison with a - 0.10. 
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Table 98. Statistical conopari&cn between overall classificaticMi results for the 
SBCHO classifier for two dimensionality reducticxi techniques (Feature 
Selection, K-L transformation) for the best 3 channel feature set 
based upon 1979 supervised training statistics and sample block test 
data. 



Reduction. / 
Technique^ 

Table 

Location 

% Correct 

No. of 
Samples 

Significant-/ 

Difference?^ 

Overall 

Classification 

Performance 

Feature Selection 
(Untransformed) 

K-L Transformed 

(Table 82) 
(Table 85) 

86.8 

86.6 

10,557 

>vji ■* 


^Feature selection cptimum subset includes channels 1, 3, & 6 of the original 1979 
OMS data set. 

K-L transformatiai includes the first 3 conponents of the K-L transformed 1979 
data set. 

2 / 

Classification performance difference is based upon a Newman-Keuls conparison 
with a = 0.10. 
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Table 99. Statistical axrparison between classificaticai results for the 
SEOIO classifier by cover class for two dimensionality 
reduction techniques (Feature SelectioOf K-L transformation) 
for the best three channel feature set based upon the 1979 
si^)ervised training statistics and san^le block test data. 


Cover 

Reduction, / 
Techniques 

Table 


No. of 

Significant2/ 

Difference?^ 

Class 

Location 

% Correct 

Samples 


Feature Selecticxi 

(Table 82) 

96.5 



PINE 

(Untransformed) 



775 

No 


K-L Transformed 

(Table 85) 

91.2 




Feature Selection 


89.1 



HDWD 

(Untransformed) 

Same as 
above 


7269 

Yes 


K-L Transformed 


91.3 




Feature Selection 


22.0 



TUPE 

(Untransformed) 

Same as 
above 


118 

Yes 


K-L Transformed 


52.5 




Feature Selection 


74.6 



OCUT 

(Untransformed) 

Same as 
above 


370 

Yes 


K-L Transformed 


50.8 




Feature Selection 


68.3 



PAST 

(Untransformed) 

Same as 
above 


350 

Yes 


K-L Transformed 


84.9 




Feature Selection 


62.9 



CROP 

(Untransformed) 

Same as 
above 


369 

Yes 


K-L Transformed 


87.3 




Feature Selection 


92.0 



SOIL 

(Untransformed) 

Same as 
above 


1006 

Yes 


K-L Transformed 


70.6 




Feature Selection 


81.3 



WATER 

(Untransformed) 

Same as 
above 


300 

Yes 


K-L Transformed 


73.0 




^Feature selection c^>tiinum subset includ'es channels 1, 3, & 6 of the 
original 1979 IMS data set. 


K-L transformation includes the first 3 coip^nents of the K-L 
transformed 1979 data set. 

^Classification performance differences are based tpon a Newman-Keuls 
coiT^rison with a = 0.10. 
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Table 100. Statistical coirparison among overall classification results for 
all 3 algorithms (L2, (HL, SECHO) using the first 4 corrponents 
of the 1979 K-L transformed OMS data and based upon the 1979 
supervised statistics and sample block test data. 




Table 


No. of 

Significant, , 
Differ wces'^ 


Algorithm 

Location 

% Correct 

Samples 


L2 

(Table 89) 

83.8 


L2/SECH0 

Overall 

Classification 

GML 

(Table 90) 

84.6 

10,557 

(liVSEaiO 

Performance 

SECHO 

(Table 91) 

87.0 




^Classification algorithms which are significantly differoit are indicated 
based i^n a Newman-Keuls comparison with a - 0.10. 


* 
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Table 101. Statistical con^Jarison among classification results by 
cover class for all three algorithms (L2r O^r SECHO) 
using the 1st 4 coir^nents of the 1979 K-L transformed 
TNS data and based upon the 1979 supervised training 
statistics and sample block test data. 


Cover 


Table 


No. of 

Significant,/ 

Differences^ 

Class 

AJ.gorithm 

Location 

% Correct 

Sanqples 


L2 

(Table 89) 

89.2 


L2/Otti 

PINE 

GML 

(Table 90) 

92.0 

775 

L2/SECH0 


SECHO 

(Table 91) 

92.9 



HDKD 

L2 

GML 

SECHO 

Same as 
above 

86.1 

88.7 

92.4 

7269 

All 

TUPE 

L2 

GML 

Same as 
above 

63.6 

36.4 

118 

SECH0/L2 

GML/L2 


SECHO 

28.8 



CCUT 

L2 

GML 

SECHO 

Same as 
above 

61.6 

55.9 

56.2 

370 

None 


L2 

Same as 
above 

68.6 


L2/SECH0 

PAST 

GML 

86.3 

350 

L2/a?L 


SECHO 

85.7 




L2 

Same as 
above 

89.4 


SECH0/L2 

CROP 

GML 

SECHO 

73.2 

71.8 

369 

GMD/L2 


L2 

Same as 
above 

75.5 


SECB0/L2 

SOIL 

GML 

69.9 

1006 

(Mj/U 


SECHO 

69.7 




L2 

Same as 
above 

87.0 


SECH0/L2 

WATER 

OIL 

SECHO 

81.0 

81.0 

300 

GMI/I2 


•^Classification algorithms which are significantly differoit are 
indicated based upon a Newman-Keuls crai^arison with a = 0.10. 
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Table 102. Statistical conparison betweai overall classification results for the L2. 
classifier for two dimensionality reduction techniques (Feature 
Selection, K-L transformation) for the best 4 channel feature set based 
upon 1979 supervised training statistics and sample block test data. 


Reductiai, / 
Technique^ 


Table 

Location 


% Correct 


No. of 
Samples 


Significant2/ 

Difference?^ 


Oireraii Feature Selection (Table 86) 81.8 

Classification <l‘'*'transfonned) 

Performance Transformed (Table 89) 83.8 


10,557 


Yes 




Feature selection optimum subset includes channels 2, 4, 5, & 7 of the original 
1979 TMS data set. 


K-L transformation includes the first 4 conponents of the K-L transformed 1979 
data set. 

2 / . . 

^Classification performance difference is beised upon a Newman-Keuls oonparison 
with a s 0.10. 


Table 103. Statistical con^nson between classificaticai results for the 
L2 classifier by cover class for two dijnenslcxiality recluction 
techniques (Feature Selection, K-L transformation) for the 
best four channel feature set based i^n 1979 supervised 
training statistics and sample block test data. 


Cover 

Class 


ReducticMiw 

Technique^ 


Table No. of Significant 

Location % Correct Sanples Differt^e?^ 


Feature Selection (Table 86) 
(Untransformed) 


K-L Transformed (Table 89) 


Feature Selection 
(untransformed) 


Same as 
above 


K-L Transformed 


Feature Selecticai 
(Untransformed) 


Same as 
above 


K-L Transformed 


Feature Selection 
(Untransformed) 


Same as 
above 


K-L Transformed 


Feature Selection 
(Untransformed) 


Same as 
above 


K-L Transformed 


Feature Selection 


88.1 


(Untransformed) 

Same as 
above 


369 

K-L Transformed 


89.4 



Feature selection cptimum subset includes channels 2, 4, 5, & 7 of the 
original 1979 IMS data set. 


K-L transformaticHi includes the first 4 components of the K-L 
transformed 1979 data set. 


Classif icatioi performance differences are based upon a Newman-Keuls 
conparison with a = 0.10. 
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Table 104. Statistical coirparison between overall classificaticKi results for the 
GML classifier for two dimaisionality reduction techniques (Feature 
Selection, K-L transformation) for the best 4 channel feature set 
based upon 1979 supervised training statistics and sample block test 
data. 



Reduction,/ 

Technique^ 

Table 


No. of 

Significant-/ 

Differaice?^ 


Locati<»i 

% Correct 

Samples 

Overall 

Classification 

Feature Selection 
(Untransfonned) 

(Table 87) 

88.1 

10,557 

Yes 

Performance 

K-L Transformed 

(Table 90) 

84.6 




^Feature selection optimum subset includes channels 2, 4, 5, & 7of the original 
1979 TNS data set. 


K-L transformaticxi includes the first 4 coirponents of the K-L transformed 1979 
data set. 

^Classification performance differaice is based upon a Newman-Keuls conparison 
with ct = 0.10. 
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Table 105, Statistical comparison between classification results for the 
GML classifier by cover class for two dimoisionality reduction 
techniques (Feature Selection, K-L transformation) for the 
best four channel feature set based w^n 1979 si^rvised 
training statistics and sample block test data. 


Cover 

Reduction, / 
Technique^ 

Table 


No. of 

Significantly 
Differ en<;^e?^ 

Class 

Location 

% Correct 

Samples 


Feature Selection 

(Table 87) 

91,0 



PINE 

(Untransformed) 



775 

No 


K-L Transformed 

(Table 90) 

92.0 




Feature Selection 


91.1 



HDWD 

(Untransformed) 

Same as 
above 


7269 

Yes 


K-L Transformed 


88.7 




Feature Selection 


58.5 



TUPE 

(Untransformed) 

Same as 
above 


118 

Yes 


K-L Transformed 


36.4 




Feature SelectiOTi 


60.5 



COJT 

(Untransformed) 

Same as 
above 


370 

NO 


K-L Transformed 


55.9 




Feature Selecticn 


82.6 



PAST 

(Untransformed) 

Same as 
above 


350 

No 


K-L Transformed 


86.3 




Feature Selecticn 


79.7 



CROP 

(Untransformed) 

Same as 
above 


369 

Yes 


K-L Transformed 


73.2 




Feature Selecticxi 


85.6 



SOIL 

(Untransformed) 

Same as 
above 


1006 

Yes 


K-L Transformed 


69.9 




Feature Selecticn 


78.7 



WATER 

(Untransformed) 

Same as 
above 


300 

No 


K-L Transformed 


81.0 




■^Feature selection c^>timum subset include channels 2, 4, 5, & 7 of the 
original 1979 QMS data set. 


K-L transformation includes the first 4 con^nents of the K-L 
transformed 1979 data set, 

2 / 

^ Classif icati(xi performance differeices are based ipon a Newman-Keuls 
con^arison with a = 0,10, 
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Table 106. Statistical ccxipurison between ovecall classification results for the 
SBCHO classifier for two dimensionality redaction tecl:Aiques (Feature 
Selection, K-^L transformation) for the best 4 channel feature set based 
upon 1979 si 5 )ervised training statistics and sample block test data. 



Reduction,. 

Technique^ 

Table 


No. of 

Significant 

Difference?^ 


Location 

% Correct 

Samples 

Overall 

Classification 

Feature Selection 
(Untransformed) 

(Table 88) 

90.0 

10,557 

Yes 

Performance 

K-L Transformed 

(Table 91) 

87.0 

. 



^Feature selecticsi c^Jtimum subset includes channels 2# 4, 5, & 7 of the original 
1979 IMS data set. 


K“L transformation includes the first 4 components of the K-L transformed 1979 
data set. 

2 / 

^Classificaticxi performance difference is based upon a Newman-Keuls con^rison 
with a = 0,10, 
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Table 107. Statistical ccsnparison betweei classificatioi results for the 
SEXMO classifier by cover class for two dimensionality 
reduction techniques (Feature Selection, K-L transformatioi) 
for the best four channel feature set based iqpon 1979 
supervised training statistics and sample block test data. 



Cover 

Reduction, / 
Technique^ 

Table 


No. of 

Significant 2 / 
Difference?^ 

Class 

Location 

% Correct 

Samples 


Feature Selectic»i 

(Table 88) 

92.9 



PINE 

(Untransformed) 



775 

No 


K-L Transformed 

(Table 91) 

92.9 




Feature Selection 


93.7 



HDWD 

(Untransformed) 

Same as 
above 


7269 

Yes 


K-L Transformed 


92.4 




Feature Selection 


57.6 



TUPE 

(Untransformed) 

Same as 
above 


118 

Yes 


K-L Transformed 


28.8 




Feature Selection 


58.9 



CCUT 

(Untransformed) 

Same as 
above 


370 

No 


K-L Transformed 


56,2 




Feature Selection 

, 

83.1 



EAST 

(Untransformed) 

Same as 
above 


350 

No 


K-L Transformed 


85.7 




Feature Selectiwi 


81.6 



CROP 

(Untransformevi) 

Same as 
above 


369 

Yes 


K-L Transformed 


71.8 




Feature Selection 


86.0 



SOIL 

(Untransformed) 

Same as 
above 


1006 

Yes 


K-L Transformed 


69.7 




Feature Selection 


79.7 



WATER 

(Untransformed) 

Same as 
above 


300 

No 


K-L Transformed 


81.0 




■^Feature selectica:i optimum subset includes channels 2, 4, 5, & 7 of the 
original 1979 OMS data set. 


K-L transformation includes the first 4 conponents of the K-L 
transformed 1979 data set. 

2 / 

^ Classification performance differences are based upon a Newman-Keuls 
comparison with a = 0.10. 
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Table 108. Statistics from original 1979 7MS data (sampled every 5th line 

and 5th column) used in cedculation of K-L transformation matrix. 


Channel 



1 . 

2 

3 

4 

5 

6 

7 

Mean Vector 

59.8 

61.4 

44.8 

128.9 

113.4 

59.9 

78.1 

Standard 

Deviation 

12.0 

18.2 

23.2 

29.5 

24.5 

24.4 

30.6 

Covariance 
Matrix Diagonal 

144.6 

330.9 

538.6 

868.6 

600.5 

596.7 

935.1 


Total Variance = 4014.90 

Correlation 


Matrix 

1.00 







0.95 

1.00 






0.90 

0.96 

1.00 





•"0.02 

0.01 

-0.07 

1.00 




0.16 

0.21 

0.19 

0.91 

1.00 



0.67 

0.74 

0.82 

0.26 

0.58 

1.00 


0.33 

0.43 

0.55 

-0.10 

0.25 

0.73 

Covariance 







Matrix 

144.6 

t 






208.5 

330.9 






252.3 

404.8 

538.6 





-5.9 

3.7 

-51.2 

868.6 




46.0 

94.9 

106.5 

657.5 

600.5 



195.4 

330.0 

463.7 

184.1 

349.2 

596.7 


122.8 

238.8 

388.7 

-87,8 

185.7 

543.2 


935.1 
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Table 109. Summary of 1979 K-L Transformed TMS Data. 


n^tm. of .i;igenyegJ^.Q££ 



OLi 


OLI 

S£LA 

OLI 

OL6. 


(X^) Eigenvector 1 

0.18140 

-0.09369 

-0.32157 

-0.26026 

0.69161 

-0.29629 

-0.47017 

( < 2 ) Eigenvector 2 

0.307 86 

-0.14271 

-0.43681 

-0.36567 

0.14429 

0.30636 

0.66758 

, \^) Eigenvector 3 

0.41710 

-0.23906 

-0.43389 

-0.02481 

-0.62199 

0.6217 

-0.43505 

( \^) Eigenvector 4 

0.21030 

0.75156 

-0.04241 

-0.34249 

-0.22526 

-0.45783 

0.10712 

(X^) Eigenvector 5 

0.33933 

0.50853 

0.09323 

0.16958 

0.20906 

0.69222 

-0.25674 

(Xg) Eigenvector 6 

0.52169 

-0.02982 

-0.03139 

0.71851 

0.13918 

-0.35212 

0.25758 

(X^) Eigenvector 7 

0.51653 

-0.29895 

0.71137 

-0.36826 

-0.01282 

-0.04145 

-0.01657 


Cumulative 

£igeavalue. pepcg at q £ .var i . an c.g ■.fs j . c .ent. _iSSE. 



2069.27 

51.54% 

51.54% 

48.46 

^2 

1357.44 

33.81 

85.35 

14.65 

^3 

501.46 

12.49 

97.84 

2.16 

^4 

58.19 

1.45 

99.29 

0.71 

'5 

14.06 

0.35 

99.64 

0.36 

'‘6 

8.72 

0.22 

99.86 

0.14 


5.77 

0.14 

100.0 

0.00 


APPE1®IX D (Tables 110-118) 


1980 and MSS Classification Results 
used in the Quantitative Evaluation of the SAR Data 
and in the SA^/MSS Comparison 


Table 110. Classification Results for the 15 m SAR Data Using the GHL Per-*Foint Classifier 






ORIGINAL 

OF POOR QUALITY 




ORIGINAL PAGE IS 
OF POOR QUALITY 



264 



FIEID PEFIPORMANCE (49/84) = j8.3% 
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Table 114. Classification Results for the 30 m SAR Data Using the Per-Field Classifier 
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Table 115. Classification Results for the 30 m SftR Data Using thf SBCHO Classifier 
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Table 117, Classification Results for the 30 m MSS Data Using the Per-Field Classifier 
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Table 118. Classification Results for the 30 m MSS Data Using the SECHO Classifier. 


' ^ , 
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